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CLONING OF CYTOCHROME P450 GENES FROM NICOTIANA 

The present invention relates to nucleic acid sequences 
encoding cytochrome P450 enzymes (hereinafter referred to as 
P450 and P450 enzymes) in Nicotiana plants and methods for 
using those nucleic acid sequences to alter plant phenotypes. 

BACKGROUND 

Cytochrome P450s catalyze enzymatic reactions for a 
diverse range of chemically dissimilar substrates that include 
the oxidative, peroxidative and reductive metabolism of 
endogenous and xenobiotic substrates. In plants, P450s 

participate in biochemical pathways that include the synthesis 
of plant products such as phenylpropanoids, alkaloids, 
terpenoids, lipids, cyanogenic glycosides, and glucosinolates 
(Chappel, Annu. Rev. Plant Physiol. Plant Mol . Biol. 198, 
49:311-343) - Cytochrome P450s, also known as P450 heme- 
thiolate proteins, usually act as terminal oxidases in multi- 
component electron transfer chains, called P450- containing 
monooxygenase systems. Specific reactions catalyzed include 
demethylation, hydroxylation, epoxidation, N-oxidation, 
sulfooxidation, N-, S-, and 0- dealkylations, desulfation, 
deamination, and reduction of azo, nitro, and N-oxide groups. 

The diverse role of Nicotiana plant P450 enzymes has been 
implicated in effecting a variety of plant metabolites such as 
phenylpropanoids, alkaloids, terpenoids, lipids, cyanogenic 
glycosides, glucosinolates and a host of other chemical 
entities. During recent years, it is becoming apparent that 
some P450 enzymes can impact the composition of plant 
metabolites in plants. For example, it has been long desired 
to improve the flavor and aroma of certain plants by altering 
its profile of selected fatty acids through breeding; however 
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very little is known about mechanisms involved in controlling 
the levels of these leaf constituents. The down regulation of 
P450 enzymes associated with the modification of fatty acids 
may facilitate accumulation of desired fatty acids that provide 

5 more preferred leaf phenotypic qualities. The function of P450 
enzymes and their broadening roles in plant constituents is 
still being discovered. For instance, a special class of P450 
enzymes was found to catalyze the breakdown of fatty acid into 
volatile C6- and C9-aldehydes and -alcohols that are major 

10 contributors of "fresh green" odor of fruits and vegetables. 
The level of other novel targeted P450s may be altered to 
enhance the qualities of leaf constituents by modifying lipid 
composition and related break down metabolites in Nicotiana 
leaf. Several of these constituents in leaf are affected by 

15 senescence that stimulates the maturation of leaf quality 
properties. Still other reports have shown that P450s enzymes 
are play a functional role in altering fatty acids that are 
involved in plant-pathogen interactions and disease resistance. 

20 In other instances, P450 enzymes have been suggested to be 

involved in alkaloid biosynthesis. Nornicotine is a minor 
alkaloid found in Nicotiana tabaccum. It is has been 

postulated that it is produced by the P450 mediated 
demethylation of nicotine followed by acylation and nitrosation 

25 at the N position thereby producing a series of N- 
acylnonicotines and N-nitrosonornicotines . N- demethylation , 
catalyzed by a putative P450 demethylase, is thought to be a 
primary source of nornicotine biosyntheses in Nicotiana. While 
the enzyme is believed to be microsomal, thus far a nicotine 

30 demethylase enzyme has not been successfully purified, nor have 
the genes involved been isolated. 

Furthermore, it is hypothesized but not proven that the 
activity of P450 enzymes is genetically controlled and also 
35 strongly influenced by * environment factors. For example, the 

2 
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demethylation of nicotine in Nicotiana is thought to increase 
substantially when the plants reach a mature stage. 
Furthermore, it is thought that the demethylase gene contains a 
transposable element that can inhibit translation of RNA when 
5 present. 

The large multiplicity of P450 enzyme forms, their 
differing structure and function have made their research on 
Nicotiana P450 enzymes very difficult before the enclosed 

10 invention. In addition, cloning of P450 enzymes has been 
hampered at least in part because these membrane-localized 
proteins are typically present in low abundance and often 
unstable to purification. Hence, a need exists for the 
identification of P450 enzymes in plants and the nucleic acid 

15 sequences associated with those P450 enzymes. In particular, 
only a few cytochrome Nicotiana P450 proteins have been 
reported in tobacco. The inventions described herein entail 
the discovery of a substantial number of cytochrome P450 
fragments that correspond to several groups of P450 species 

20 based on their sequence identity. 

SUMMARY 

The present invention is directed to plant P450 enzymes. 
The present invention is further directed to plant P450 enzymes 

25 from Nicotiana. The present invention is also directed to P450 
enzymes in plants whose expression is induced by ethylene 
and/or plant senescence. The present invention is yet further 
directed to nucleic acid sequences in plants having enzymatic 
activities, for example, oxygenase, demethylase and the like, 

30 or other and the use of those sequences to reduce or silence 
the expression of these enzymes. The invention also relates to 
P450 enzymes found in plants containing higher nornicotine 
levels than plants exhibiting lower nornicotine levels. 



3 
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In one aspect, the invention is directed to nucleic acid 
sequences as set forth in SEQ. ID. Nos . 1, 3, 5, 7, 9, 11, 13, 
15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 
47, 49, 51, 53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 
79, 81, 83, 85, 87, 89, 91, 93, 95, 97, 99, 101, 103, 105, 107, 
109, 111, 113, 115, 117, 119, 121, 123, 125, 127, 129, 131, 
133, 135, 137, 139, 141, 143, 145 and 147. 

In a second related aspect, those fragments containing 
greater than 75% identity in nucleic acid sequence were placed 
into groups dependent upon their identity in a region 
corresponding to the first nucleic acid following the 
cytochrome P450 motif GXRXCX(G/A) to the ' stop codon. The 
representative nucleic acid groups and respective species are 
shown in Table I . 

In a third aspect, the invention is directed to amino acid 
sequences as set forth in SEQ. ID. Nos. 2, 4, 6, 8, 10, 12, 14, 
16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 
48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 
80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 
108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 
132, 134, 136, 138, 140, 142, 144, 146 and 148. 

In a fourth related aspect, those fragments containing 
greater than 71% identity in amino acid sequence were placed 
into groups dependent upon their identity to each other in a 
region corresponding to the first amino acid following the 
cytochrome P450 motif GXRXCX(G/A) to the stop codon. The 
representative amino acid groups and respective species are 
shown in Table II. 

In a fifth aspect of the invention is the use of nucleic 
acids sequences as set forth in SEQ. ID. Nos. 1, 3, 5, 7, 9, 
11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 

4 
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43, 45, 47, 49, 51, 53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 
75, 77, 79, 81, 83, 85, 87, 89, 91, 93, 95, 97, 99, 101, 103, 
105, 107, 109, " 111, 113, 115, 117, 119, 121, 123, 125, 127, 
129, 131, 133, 135, 137, 139, 141, 143, 145 and 147. 

5 

In a sixth related aspect, the reduction or elimination of 
P450 enzymes in Nicotiana plants may be accomplished 
transiently using RNA viral systems. Resulting transformed 
or infected plants are assessed for phenotypic changes 
10 including, but not limited to, analysis of endogenous P450 RNA 
transcripts, P450 expressed peptides, and concentrations of 
plant metabolites using technigues commonly available to one 
having ordinary skill in the art. 

15 in a seventh important aspect, the present invention is 

also directed to generation of trangenic Nicotiana lines that 
have altered P450 enzyme activity levels. In accordance with 
the invention, these transgenic lines include nucleic acid 
sequences that are effective for reducing or silencing the 

20 expression of certain enzyme thus resulting in phenotypic 
effects within Nicotiana. Such nucleic acid sequences include 
SEQ. ID. Nos. 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 
27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 47, 49,. 51, 53, 55, 57, 
59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 

25 91, 93, 95, 97, 99, 101, 103, 105, 107, 109, 111, 113, 115, 
117, 119, 121, 123, 125, 127, 129, 131, 133, 135, 137, 139, 
141, 143, 145 and 147. 

In a very important eight aspect of the invention, plant 
30 cultivars including nucleic acids of the present invention in a 
down regulation capacity will have altered metabolite profiles 
relative to control plants. 

In a ninth aspect, the present invention is directed to 
35 the screening of plants, more preferably Nicotiana, that 

5 
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contain genes that have substantial nucleic acid identity to 
the taught nucleic acid sequence. The use of the invention 
would be advantageous to identify and select plants that 
contain a nucleic acid sequence with exact or substantial 
5 identity where such plants are part of a breeding program for 
traditional or transgenic varieties, a mutagenesis program, or 
naturally occurring diverse plant populations. The screening 
of plants for substantial nucleic acid identity may be 
accomplished by evaluating plant nucleic acid materials using a 
10 nucleic acid probe in conjunction with nucleic acid detection 
protocols including, but not limited to, nucleic acid 
hybridization and PCR analysis. The nucleic acid probe may 
consist of the taught nucleic acid sequence or fragment thereof 
corresponding to SEQ ID 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 
15 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 47, 49, 51, 53, 
55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 
87, 89, 91, 93, 95, 97, 99, 101, 103, 105, 107, 109, 111, 113, 
115, 117, 119, 121, 123, 125, 127, 129, 131, 133, 135, 137, 
139, 141, 143, 145 and 147. 

20 

In a tenth aspect, the present invention is directed to 
the identification of plant genes, more preferably Nicotiana, 
that share substantial amino acid identity corresponding to the 
taught nucleic acid sequence. The identification of plant 

25 genes including both cDNA and genomic clans of those cDNAs and 
genomic clones, preferably from Nicotiana may be accomplished 
by screening plant cDNA libraries using a nucleic acid probe in 
conjunction with nucleic acid detection protocols including, 
but not limited to, nucleic acid hybridization and PCR 

30 analysis. The nucleic acid probe may be comprised of nucleic 
acid sequence or fragment thereof corresponding to SEQ ID 1, 3, 
5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 
37, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 61, 63, 65, 67, 
69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 93, 95, 97, 99, 
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101, 103, 105, 107, 109, 111, 113, 115, 117, 119, 121, 123, 
125, 127, 129, 131, 133, 135, 137, 139, 141, 143, 145 and 147 

In an alterative eleventh aspect, cDNA expression 
5 libraries that express peptides may be screened using 
antibodies directed to part or all of the taught amino acid 
sequence. Such amino acid sequences include SEQ ID 2, 4, 8, 9, 
10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 
42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 
10 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 
104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 
128, 130, 132, 134, 136, 138, 140, 142, 144, 146 and 148. 

BRIEF DESCRIPTION OF DRAWINGS 

15 Figure 1 shows nucleic acid SEQ. ID. No . : 1 and amino acid 

SEQ. ID. No . : 2 . 

Figure 2 shows nucleic acid SEQ. ID. No . : 3 and amino acid 

SEQ. ID. No. : 4 . 

Figure 3 shows nucleic acid SEQ. ID. No . : 5 and amino acid 

20 SEQ. ID. No. : 6. 

Figure 4 shows nucleic acid SEQ. ID. No . : 7 and amino acid 

SEQ. ID. No.:8. 

Figure 5 shows nucleic acid SEQ. ID. No . : 9 and amino acid 

SEQ. ID. No . : 10 . 

25 Figure 6 shows nucleic acid SEQ. ID. No.: 11 and amino acid 

SEQ. ID. No . : 12 . 

Figure 7 shows nucleic acid SEQ. ID. No.: 13 and amino acid 

SEQ. ID. No.: 14. 

Figure 8 shows nucleic acid SEQ. ID. No.: 15 and amino acid 

30 SEQ. ID. No. : 16. 

Figure' 9 shows nucleic acid SEQ. ID. No.:17 and amino acid 
§EQ. ID. No . : 18 . 

Figure 10 shows nucleic acid SEQ. ID. No.: 19 and amino 
acid SEQ. ID. No. :20. 
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Figure 11 shows 
acid SEQ. ID. No. :22. 

Figure 12 shows 
acid SEQ. ID. No. :24. 

Figure 13 shows 
acid SEQ. ID. No. :26. 

Figure 14 shows 
acid SEQ. ID. No. :28. 

Figure 15 shows 
acid SEQ. ID. No.:30. 

Figure 16 shows 
acid SEQ. ID. No.:32. 

Fi gur e 1 7 shows 
acid SEQ. ID. No.: 34. 

Figure 18 shows 
acid SEQ. ID. No. :36. 

Figure 19 shows 
acid SEQ. ID. No. :38. 

Figure 20 shows 
acid SEQ. ID. No. :40. 

Figure 21 shows 
acid SEQ. ID. No. :42. 

Figure 22 shows 
acid SEQ. ID. No. :44„ 
Figure 23 shows 
acid SEQ. ID. No. :46. 

Figure 24 shows 
acid SEQ. ID. No. : 48 . 

Figure 25 shows 
acid SEQ. ID. No.:50. 

Figure 2 6 shows 
acid SEQ. ID. No. : 52 . 

Figure 27 shows 
acid SEQ. ID. No.:54. 



nucleic acid SEQ. 
nucleic acid SEQ. 
nucleic acid SEQ. 
nucleic acid SEQ. 
nucleic acid SEQ. 
nucleic acid SEQ. 

nucleic acid SEQ. 

nucleic acid SEQ. 

nucleic acid SEQ. 

nucleic acid SEQ. 

nucleic acid SEQ. 

nucleic acid SEQ.^ 

nucleic acid SEQ. 

nucleic acid SEQ. 

nucleic acid SEQ. 

nucleic acid SEQ. 

nucleic acid SEQ. 
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ID. No.: 21 and amino 

ID. No.: 23 and amino 

ID. No.: 25 and amino 

ID. No.:27 and amino 

ID . No . : 2 9 and amino 

ID. No.: 31 and amino 

ID. No.: 33 and amino 

ID. No.:35 and amino 

ID . No . : 3 7 and amino 

ID. No.: 39 and amino 

ID . No . : 4 1 and amino 

ID . No . : 43 and amino 

ID . No . : 45 and amino 

ID. No.:47 and amino 

ID. No.: 49 and amino 

ID . No . : 51 and amino 

ID . No . : 53 and amino 
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Figure 2 8 shows 
acid SEQ. ID. No. :56. 

Figure 2 9 shows 
acid SEQ . ID. No. : 58 . 

Figure 30 shows 
acid SEQ. ID. No. : 60. 

Figure 31 shows 
acid SEQ. ID. No. : 62. 

Figure 32 shows 
acid SEQ. ID. No. : 64. 

Figure 33 shows 
acid SEQ. ID. No.: 66. 

Figure 3 4 shows 
acid SEQ. ID. No.: 68. 

Figure 35 shows 
acid SEQ. ID. No. :70. 

Figure 3 6 shows 
acid SEQ. ID. No. :72. 

Figure 37 shows 
acid SEQ. ID. No. :74. 

Figure 38 shows 
acid SEQ. ID. No. : 76. 

Figure 39 shows 
acid SEQ. ID. No. :78. 

Figure 40 shows 
acid SEQ. ID. No. :80. 

Figure 41 shows 
acid SEQ. ID. No. : 82. 

Figure 42 shows 
acid SEQ. ID. No. : 84. 

Figure 43 shows 
acid SEQ. ID. No. : 86. 

Figure 44 shows 
acid SEQ. ID. No. : 88 . 



nucleic acid SEQ. 
nucleic acid SEQ. 
nucleic acid SEQ. 
nucleic acid SEQ. 
nucleic acid SEQ. 
nucleic acid SEQ. 
nucleic acid SEQ. 

nucleic acid SEQ. 

nucleic acid SEQ. 

nucleic acid SEQ. 

nucleic acid SEQ. 

nucleic acid SEQ. 

nucleic acid SEQ. 

nucleic acid SEQ. 

nucleic acid SEQ. 

nucleic acid SEQ. 

nucleic acid SEQ. 
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ID . No . : 55 and amino 

ID. No.: 57 and amino 

ID. No.: 59 and amino 

ID. No.: 61 and amino 

ID. No.: 63 and amino 

ID . No . : 65 and amino 

ID . No . : 67 and amino 

ID. No.: 69 and amino 

ID. No.: 71 and amino 

ID. No.: 73 and amino 

ID . No . : 7 5 and amino 

ID. No.: 77 and amino 

ID. No.: 79 and amino 

ID. No.: 81 and amino 

ID. No.: 83 and amino 

ID . No . : 8 5 and amino 

ID. No.: 87 and amino 
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Figure 4 5 shows nucleic 
acid SEQ. ID. No . : 90 . 

Figure 4 6 shows nucleic 
acid SEQ. ID. No . : 92 * 

Figure 47 shows nucleic 
acid SEQ. ID. No. :94. 

Figure 4 8 shows nucleic 
acid SEQ. ID. No. :96. 

Figure 49 shows nucleic 
acid SEQ. ID. No. :98. 

Figure 50 shows nucleic 
acid SEQ. ID. No. :100. 

Figure 51 shows nucleic 
acid SEQ. ID. No. :102. 

Figure 52 shows nucleic 
acid SEQ. ID. No. :104. 

Figure 53 shows nucleic 
acid SEQ. ID. No.: 106. 

Figure 54 shows nucleic 
acid SEQ. ID. No. : 108. 

Figure 55 shows nucleic 
acid SEQ. ID. No.: 110. 

Figure 56 shows nucleic 
acid SEQ. ID. No. :112. 

Figure 57 shows nucleic 
acid SEQ. ID. No.:114. 

Figure 58 shows nucleic 
acid SEQ. ID. No.:116. 

Figure 59 shows nucleic 
acid SEQ. ID. No.: 118. 

Figure 60 shows nucleic 
acid SEQ. ID. No.:120. 

Figure 61 shows nucleic 
acid SEQ. ID. No. :122. 
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acid SEQ. ID. No.: 89 and amino 

acid SEQ. ID. No.: 91 and amino 

acid SEQ. ID. No.:93 and amino 

acid SEQ. ID. No.: 95 and amino 

acid SEQ. ID. No.: 97 and amino 

acid SEQ. ID. No.:99 and amino 

acid SEQ. ID. No.:101 and amino 

acid SEQ. ID. No.: 103 and amino 

acid SEQ. ID. No.:105 and amino 

acid SEQ. ID. No.:107 and amino 

acid SEQ. ID. No.: 109 and amino 

acid SEQ. ID. No.: Ill and amino 

acid SEQ. ID. No.: 113 and amino 

acid SEQ. ID. No.:115 and amino 

acid SEQ. ID. No.:117 and amino 

acid SEQ. ID. No.: 119 and amino 

acid SEQ. ID. No.: 121 and amino 
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Figure 62 shows nucleic acid SEQ. ID. No„:123 and amino 
acid SEQ . ID. No.: 124. 

Figure 63 shows nucleic acid SEQ. ID. No-: 125 and amino 
acid SEQ. ID. No. :126. 
5 Figure 64 shows nucleic acid SEQ. ID. No.:127 and amino 

acid SEQ. ID. No.: 128. 

Figure 65 shows nucleic acid SEQ. ID. No-: 129 and amino 
acid SEQ. ID. No. :130. 

Figure 66 shows nucleic acid SEQ. ID. No.: 131 and amino 
10 acid SEQ. ID. No.:132. 

Figure 67 shows nucleic acid SEQ. ID. No.:133 and amino 
acid SEQ. ID. No.:134. 

Figure 68 shows nucleic acid SEQ. ID. No.:135 and amino 
acid SEQ. ID. No.:136. 
15 Figure 69 shows nucleic acid SEQ. ID. No.: 137 and amino 

acid SEQ. ID. No.:138. 

Figure 70 shows nucleic acid SEQ. ID. No.:139 and amino 
acid SEQ. ID. No.:140. 

Figure 71 shows nucleic acid SEQ. ID. No.: 141 and amino 
20 acid SEQ. ID. No.:142. 

Figure 72 shows nucleic acid SEQ. ID. No.:143 and amino 
acid SEQ. ID. No.: 144. 

Figure 73 shows nucleic acid SEQ. ID. No.:145 and amino 
acid SEQ. ID. No.: 146. 
25 Figure 74 shows nucleic acid SEQ. ID. No.: 147 and amino 

acid SEQ. ID. No.:148. 

Figure 75 shows a procedure used for cloning of cytochrome 
P450 cDNA fragments by PGR. SEQ . ID. Nos. 149-156 are shown. 

Figure 7 6 illustrates amino acid identity of group 
30 members . 

DETAILED DESCRIPTION 

DEFINITIONS 

Unless defined otherwise, all technical and scientific 
35 terms used herein have the same meaning as commonly understood 

11 
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by one of ordinary skill in the art to which this invention 
belongs. Singleton et al. (1994) Dictionary of Microbiology and 
Molecular Biology, second edition, John Wiley and Sons (New 
York) provides one of skill with a general dictionary of many 
5 of the terms used in this invention. All patents and 
publications referred to herein are incorporated by reference 
herein. For purposes of the present invention, the following 
terms are defined below. 

10 "Enzymatic activity" is meant to include demethylation, 

hydroxylation, epoxidation, N-oxidation, sulf ooxidation, N-, S- 
, and 0- dealkylations, desulfation, deamination, and reduction 
of azo, nitro, and N-oxide groups. The term "nucleic acid" 
refers to a deoxyribonucleotide or ribonucleotide polymer in 

15 either single- or double-stranded form, or sense or anti-sense, 
and unless otherwise limited, encompasses known analogues of 
natural nucleotides that hybridize to nucleic acids in a manner 
similar to naturally occurring nucleotides. Unless otherwise 
indicated, a particular nucleic acid sequence includes the 

20 complementary sequence thereof. The terms "operably linked", 
"in operable combination", and "in operable order" refer to 
functional linkage between a nucleic acid expression control 
sequence (such as a promoter, signal sequence, or array of 
transcription factor binding sites) and a second nucleic acid 

25 sequence, wherein the expression control sequence affects 
transcription and/or translation of the nucleic acid 
corresponding to the second sequence. 

The term "recombinant" when used with reference to a cell 
30 indicates that the cell replicates a heterologous nucleic acid, 
expresses said nucleic acid or expresses a peptide, 
heterologous peptide, or protein encoded by a heterologous 
nucleic acid. Recombinant cells can express genes or gene 
fragments in either the sense or antisense form that are not 
35 found within the native (non- recombinant) form, of the cell. 

12 
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Recombinant cells can also express genes that are found in the 
native form of the cell, but wherein the genes are modified and 
re- introduced into the cell by artificial means. 

5 A "structural gene 7 ' is that portion of a gene comprising a 

DNA segment encoding a protein, polypeptide or a portion 
thereof, and excluding the 5 1 sequence which drives the 
initiation of transcription. The structural gene may 

alternatively encode a nontranslatable product. The structural 

10 gene may be one which is normally found in the cell or one 
which is not normally found in the cell or cellular location 
wherein it is introduced, in which case it is termed a 
"heterologous gene". A heterologous gene may be derived in 
whole or in part from any source known to the art, including a 

15 bacterial genome or episome, eukaryotic, nuclear or plasmid 
DNA, cDNA, viral DNA or chemically synthesized DNA. A 
structural gene may contain one or more modifications that 
could effect biological activity or its characteristics, the 
biological activity or the chemical structure of the expression 

20 product, the rate of expression or the manner of expression 
control. Such modifications include, but are not limited to, 
mutations, insertions, deletions and substitutions of one or 
more nucleotides. The structural gene may constitute an 
uninterrupted coding sequence or it may include one or more 

25 introns, bounded by the appropriate splice junctions. The 
structural gene may be translatable or non-translatable, 
including in an anti-sense orientation. The structural gene 
may be a composite of segments derived from a plurality of 
sources and from a plurality of gene sequences (naturally 

30 occurring or synthetic, where synthetic refers to DNA that is 
chemically synthesized) . 

"Derived from" is used to mean taken, obtained, received, 
traced, replicated or descended from a source (chemical and/or 
35 biological) . A derivative may be produced by chemical or 

13 
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biological manipulation (including, but not limited to, 
substitution, addition, insertion, deletion, extraction, 
isolation, mutation and replication) of the original source. 

5 "Chemically synthesized", as related to a sequence of DNA, 

means that portions of the component nucleotides were assembled 
in vitro* Manual chemical synthesis of DNA may be accomplished 
using well established procedures (Caruthers, Methodology of 
DNA and RNA Sequencing , (1983), Weissman (ed.), Praeger 
10 Publishers, New York, Chapter 1) ; automated chemical synthesis 
can be performed using one of a number of commercially 
available machines . 

Optimal alignment of sequences for comparison may be 
15 conducted by the local homology algorithm of Smith and 
Waterman, Adv. Appl . Math. 2:482 (1981), by the homology 
alignment algorithm of Needleman and Wunsch, J. Mol . Biol. 
48:443 (1970), by the search for similarity method of Pearson 
and Lipman Proc. Natl. Acad. Sci. (U.S.A.) 85: 2444 (1988), by 
20 computerized implementations of these algorithms (GAP, BESTFIT, 
FASTA, and T FAST A in the Wisconsin Genetics Software Package, 
Genetics Computer Group, 575 Science Dr., Madison, Wis.), or by 
inspection. 

i 

25 The NCBI Basic Local Alignment Search Tool (BLAST) 

(Altschul et al., 1990) is available from several sources, 
including the National Center for Biological Information (NCBI, 
Bethesda, Md. ) and on the Internet, for use in connection with 
the sequence analysis programs blastp, blastn, blastx, tblastn 

30 and tblastx. It can be accessed at 

htp: //www.ncbi .nlm.nih. gov/BLAST/. A description of how to 
determine sequence identity using this program is available at 
http : //www . ncbi . nlm . nih . gov/BLAST/blas t help . html . 
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The terras "substantial amino acid identity" or 
"substantial amino acid sequence identity" as applied to amino 
acid sequences and as used herein denote a characteristic of a 
polypeptide, wherein the peptide comprises a sequence that has 
5 at least 70 percent sequence identity, preferably 80 percent 
amino acid sequence identity, more preferably 90 percent amino 
acid sequence identity, and most preferably at least 99 to 100 
percent sequence identity as compared to a reference group over 
region corresponding to the first amino acid following the 
10 cytochrome P450 motif GXRXCX(G/A) to the stop codon of the 
translated peptide* 

The terms "substantial nucleic acid identity" or 
"substantial nucleic acid sequence identity" as applied to 

15 nucleic acid sequences and as used herein denote a 
characteristic of a polynucleotide sequence, wherein the 
polynucleotide comprises a sequence that has at least 75 
percent sequence identity, preferably 81 percent amino acid 
sequence identity, more preferably at least 91 to 99 percent 

20 sequence identity, and most preferably at least 99 to 100 
percent sequence identity as compared to a reference group over 
region corresponding to the first nucleic acid following the 
cytochrome P450 motif GXRXCX (G/A) to the stop codon of the 
translated peptide. 

25 

Another indication that nucleotide sequences are 
substantially identical is if two molecules hybridize to each 
other under stringent conditions. Stringent conditions are 
sequence-dependent and will be different in different 

30 circumstances. Generally, stringent conditions are selected to 
be about 5°C to about 20°C, usually about 10°C to about 15°C, 
lower than the thermal melting point (Tm) for the specific 
sequence at a defined ionic strength and pH. The Tm is the 
temperature (under defined ionic strength and pH) at which 50% 

35 of the target sequence hybridizes to a matched probe. 
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Typically/ stringent conditions will be those in which the salt 
concentration is about 0.02 molar at pH 7 and the temperature 
is at least about 60 °C. For instance ( in a standard Southern 
hybridization procedure, stringent conditions will include an 
5 initial wash in 6xSSC at 42 °C followed by one or more 
additional washes in 0.2xSSC at a temperature of at least about 
55°C, typically about 60°C and often about 65°C. 

Nucleotide sequences are also substantially identical for 
10 purposes of this invention when the polypeptides and/or 
proteins which they encode are substantially identical. Thus, 
where one nucleic acid sequence encodes essentially the same 
polypeptide as a second nucleic acid sequence, the two nucleic 
acid sequences are substantially identical, even if they would 
15 not hybridize under stringent conditions due to degeneracy 
permitted by the genetic code (see, Darnell et al . (1990) 
Molecular Cell Biology, Second Edition Scientific American 
Books W„ H. Freeman and Company New York for an explanation of 
codon degeneracy and the genetic code) . Protein purity or 

20 homogeneity can be indicated by a number of means well known in 
the art, such as polyacrylamide gel electrophoresis of a 
protein sample, followed by visualization upon staining. For 
certain purposes high resolution may be needed and HPLC or a 
similar means for purification may be utilized. 

25 

As used herein, the term "vector" is used in reference to 
nucleic acid molecules that transfer DNA segment (s) into a 
cell. A vector may act to replicate DNA and may reproduce 
independently in a host cell. The term "vehicle" is sometimes 

30 used interchangeably with "vector." The term "expression 
vector" as used herein refers to a recombinant DNA molecule 
containing a desired coding sequence and appropriate nucleic 
acid sequences necessary for the expression of the operably 
linked coding sequence in a particular host organism. Nucleic 

35 acid sequences necessary for expression in prokaryotes usually 
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include a promoter, an operator (optional) , and a ribosome 
binding site, often along with other sequences. Eucaryotic 
cells are known to utilize promoters, enhancers, and 
termination and polyadenylation signals. 

5 

For the purpose of regenerating complete genetically 
engineered plants with roots, a nucleic acid may be inserted 
into plant cells, for example, by any technique such as in vivo 
inoculation or by any of the known in vitro tissue culture 
10 techniques to produce transformed plant cells that can be 
regenerated into complete plants. Thus, for example, the 
insertion into plant cells may be by in vitro inoculation by 
pathogenic or non-pathogenic A. tumefaciens . Other such tissue 
culture techniques may also be employed. 

15 

"Plant tissue" includes differentiated and 

undifferentiated tissues of plants, including, but not limited 
to, roots, shoots, leaves, pollen, seeds, tumor tissue and 
various forms of cells in culture, such as single cells, 
20 protoplasts, embryos and callus tissue. The plant tissue may be 
in planta or in organ, tissue or cell culture. 

"Plant cell" as used herein includes plant cells In planta 
and plant cells and protoplasts in culture. "cDNA" or 
25 "complementary DNA" generally refers to ■ a single stranded DNA 
molecule with a nucleotide sequence that is complementary to an 
RNA molecule. cDNA is formed by the action of the enzyme 
reverse transcriptase on an RNA template. 

30 STRATEGIES FOR OBTAINING NUCLEIC ACID SEQUENCES 

In accordance with the present invention, RNA was 
extracted from Nicotiana tissue of converter and non-converter 
Nicotiana lines. The extracted RNA was then used to create 
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cDNA. Nucleic acid sequences of the present invention were 
then generated using two strategies. 

In the first strategy, the poly A enriched RNA was 
5 extracted from plant tissue and cDNA was made by reverse 
transcription PCR. The single strand cDNA was then used to 
create P450 specific PCR populations using degenerate primers 
plus a oligo d(T) reverse primer. The primer design was based 
on the highly conserved motifs of P450, Examples of specific 
10 degenerate primers are set forth in Figure 1. Sequence 
fragments from plasmids containing appropriate size inserts 
were further analyzed. . These size inserts typically ranged 
from about 300 to about 800 nucleotides depending on which 
primers were used. 

15 

In a second strategy, a cDNA library was initially 
constructed. The cDNA in the plasmids was used to create p450 
specific PCR populations using degenerate primers plus T7 
primer on plasmid as reverse primer. As in the first strategy F 
20 sequence fragments from plasmids containing appropriate size 
inserts were further analyzed. 

Nicotiana plant lines known to produce high levels of 
nornicotine (converter) and plant lines having undetectable 
25 levels of nornicotine may be used as starting materials. 

Leaves can then be removed from plants and treated with 
ethylene to activate P450 enzymatic activities defined herein. 
Total RNA is extracted using techniques known in the art. cDNA 
30 fragments can then be generated using PCR (RT-PCR) with the 
oligo d(T) primer as described in Figure 1. The cDNA library 
can then be constructed more fully described in examples 
herein. 

35 
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The conserved region of P450 type enzymes can be used as a 
template for degenerate primers (Figure 75) . Using degenerate 
primers, P450 specific bands can be amplified by PCR. Bands 
indicative for P450 like enzymes can be identified by DNA 
5 sequencing* PCR fragments can be characterized using BLAST 
search, alignment or other tools to identify appropriate 
candidates . 

Sequence information from identified fragments can be used 
10 to develop PCR primers. These primers are used to conduct 
quantitative RT-PCR from the RNA' s of converter and non- 
converter ethylene treated plant tissue. Only appropriate 
sized DNA bands (300-8 00 bp) from converter lines or bands with 
higher density denoting higher expression in converter lines 
15 were used for further characterization. Large scale Southern 
reverse analysis were conducted to examine the differential 
expression for all clones obtained. In this aspect of the 
invention, these large scale reverse Southern assays can be 
conducted using labeled total cDNA' s from different tissues as 
20 a probe to hybridize with cloned DNA fragments in order to 
screen all cloned inserts. 

Nonradioactive Northern blotting assay was also used to 
characterize clones P450 fragments. 

25 

Nucleic acid sequences identified as described above can 
be examined by using virus induced gene silencing technology 
(VIGS, Baulcombe, Current Opinions in Plant Biology, 1999, 
2:109-113) . 

30 

In another aspect of the invention, interfering RNA 
technology (RNAi) is used to further characterize cytochrome 
P450 enzymatic activities in Nicotiana plants of the present 
invention. The following references which describe this 
35 technology are incorporated by reference herein, Smith et al . , 
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Nature, 2000, 407:319-320; Fire et al . , Nature, 1998, 391:306- 
311; Waterhouse et al . , PNAS, 1998, 95:13959-13964; Stalberg et 
al., Plant Molecular Biology, 1993, 23:671- 683; Baulcombe, 
Current Opinions in Plant Biology, 1999, 2:109-113; and 
5 Brigneti et al . , EMBO Journal, 1998, 17 (22) : 6739-6746. Plants 
may be transformed using RNAi techniques, antisense techniques, 
or a variety of other methods described. 

Several techniques exist for introducing foreign genetic 

10 material into plant cells, and for obtaining plants that stably 
maintain and express the introduced gene. Such techniques 
include acceleration of genetic material coated onto 
microparticles directly into cells (US Patents 4,945,050 to 
Cornell and 5,141,131 to DowElanco) . Plants may be transformed 

15 using Agrobacterium technology, see US Patent 5,177,010 to 
University of Toledo, 5,104,310 to Texas A&M, European Patent 
Application 0131624B1, European Patent Applications 120516, 
159418B1, European Patent Applications 120516, 159418B1 and 
176,112 to Schilperoot, US Patents 5,149,645, 5,469,976, 

20 5,464,763 and 4,940,838 and 4,693,976 to Schilperoot, European 
Patent Applications 116718, 290799, 320500 all to MaxPlanck, 
European Patent Applications 604 662 and 627752 to Japan 
Nicotiana, European Patent Applications 02 67159, and 0292435 
and US Patent 5,231,019 all to Ciba Geigy, US Patents 5,463,174 

25 and 4,762,785 both to Calgene, and US Patents 5, 004, 863 and 
5,159,135 both to Agracetus. Other transformation technology 
includes whiskers technology, see U.S. Patents 5,302,523 and 
5,464,765 both to Zeneca. Electroporation technology has also 
been used to transform plants, see WO 87/06614 to Boyce 

30 Thompson Institute, 5,472,869 and 5,384,253 both to Dekalb, 
WO9209696 and W09321335 both to PGS . All of these 

transformation patents and publications are incorporated by 
reference. In addition to numerous technologies for 

transforming plants, the type of tissue which is contacted with 

35 the foreign genes may vary as well. Such tissue would include 
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but would not be limited to embryogenic tissue, callus tissue 
type I and II, hypocotyl, meristem, and the like. Almost all 
plant tissues may be transformed during dedif f erentiation using 
appropriate techniques within the skill of an artisan „ 

5 

Foreign genetic material introduced into a plant may 
include a selectable marker „ The preference for a particular 
marker is at the discretion of the artisan, but any of the 
following selectable markers may be used along with any other 

10 gene not listed herein which could function as a selectable 
marker. Such selectable markers include but are not limited to 
aminoglycoside phosphotransferase gene of transposon Tn5 (Aph 
II) which encodes resistance to the antibiotics kanamycin, 
neomycin and G418, as well as those genes which code for 

15 resistance or tolerance to glyphosate; hygromycin; 
methotrexate; phosphinothricin (bar) ; imidazolinones, 

sulfonylureas and triazolopyrimidine herbicides, such as 
chlorosulfuron; bromoxynil, dalapon and the like. 

20 In addition to a selectable marker, it may be desirous to 

use a reporter gene. In some instances a reporter gene may be 
used without a selectable marker. Reporter genes are genes 
which are typically not present or expressed in the recipient 
organism or tissue. The reporter gene typically encodes for a 

25 protein which provide for some phenotypic change or enzymatic 
property. Examples of such genes are provided in K. Weising et 
al. Ann. Rev. Genetics, 22, 421 (1988), which is incorporated 
herein by reference. Preferred reporter genes include without 
limitation glucuronidase (GUS) gene and GFP genes. 

30 

Once introduced into the plant tissue, the expression of 
the structural gene may be assayed by any means known to the 
art, and expression may be measured as mRNA transcribed, 
protein synthesized, or the amount of gene silencing that 
35 occurs (see U.S. Patent No. 5,583,021 which is hereby 
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incorporated by reference) ♦ Techniques are known for the in 
vitro culture of plant tissue, and in a number of cases, for 
regeneration into whole plants (EP Appln No, 88810309.0). 
Procedures for transferring the introduced expression complex 
5 to commercially useful cultivars are known to those skilled in 
the art . 

Once plant cells expressing the desired level of P450 
enzyme are obtained, plant tissues and whole plants can be 
10 regenerated therefrom using methods and techniques well-known 
in the art. The regenerated plants are then reproduced by 
conventional means and the introduced genes can be transferred 
to other strains and cultivars by conventional plant breeding 
techniques • 

15 

The following examples illustrate methods for carrying out 
the invention and should be understood to be illustrative of, 
but not limiting upon, the scope of the invention which is 
defined in the appended claims. 

20 

EXAMPLES 

EXAMPLE I: DEVELOPMENT OF PLANT TISSUE AND ETHYLENE TREATMENT 

25 Plant Growth 

Plants were seeded in pots and grown in a greenhouse for 4 
weeks* The 4 week old seedlings were transplanted into 
individual pots and grown in the greenhouse for 2 months* The 
30 plants were watered 2 times a day with water containing 150ppm 
NPK fertilizer during growth. The expanded green leaves were 
detached from plants to do the ethylene treatment described 
below. 
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Cell Line 78379 

Tobacco line 78379, which is a burley line released by the 
University of Kentucky was used as a source of plant material . 
5 One hundred plants were cultured as standard in the art of 
growing tobacco and transplanted and tagged with a distinctive 
number (1-100) . Fertilization and field management were 
conducted as recommended, 

10 Three quarters of the 100 plants converted between 20 and 

100% of the nicotine to nornicotine. One quarter of the 100 
plants converted less than 5% of the nicotine to nornicotine,, 
Plant number 87 had the least conversion (2%) while plant 
number 21 had 100% conversion. Plants converting less than 3% 

15 were classified as non-converters. Self -pollinated sefed of 
plant number 87 and plant number 21, as well as crossed (21 x 
87 and 87 x 21) seeds were made to study genetic and phenotype 
differences. Plants from selfed 21 were converters, and 99% of 
selfs from 87 were non-converters. The other 1% of the plants 

20 from 87 showed low conversion (5-15%) , Plants from reciprocal 
crosses were all converters . 

Cell Line 4407 

25 Nicotiana line 4407, which is a burley line was used as a 

source of plant material. Uniform and representative plants 
(100) were selected and tagged. Of the 100 plants 97 were non- 
converters and three were converters. Plant number 56 had the 
least amount of conversion (1.2%) and plant number 58 had the 

30 highest level of conversion (96%) . Self-pollenated seeds and 
crossed seeds were made with these two plants. 

Plants from selfed-58 were segregating in about a 3:1 
converter to non-converter ratio. The 58-33 and 58-25 were 
35 identified as homozygous converter and nonconverter plant 
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lines, respectively. The stable conversion of 58-33 was 
confirmed by analysis of its progenies of next generation. 

Ethylene Treatment Procedures 

5 

Green leaves were detached from 2-3 month greenhouse grown 
plants and sprayed with 0.3% ethylene solution (Prep brand 
Ethephon (Rhone-Poulenc) ) . Each sprayed leaf was hung in a 
curing rack equipped with humidifier and covered with plastic. 

10 During the treatment, the sample leaves were periodically 
sprayed with the ethylene solution. Approximately 24-48 hour 
post ethylene treatment, leaves were collected for RNA 
extraction. Another sub-sample was taken for metabolic 
consituents analysis to determine the concentration of leaf 

15 metabolites and more specific constituents of interest such as 
a variety of alkaloids. 

As an example, alkaloids analysis could be performed as 
follows. Samples (0.1 g) were shaken at 150 rpm with 0 . 5 ml 2N 

20 NaOH, and a 5 ml extraction solution which contained quinoline 
as an internal standard and methyl t-butyl ether. Samples were 
analyzed on a HP 6890 GC equipped with a FID detector. A 
temperature of 250 °C was used for the detector and injector. 
An HP column (30m-0 . 32nm-l -m) consisting of fused silica 

25 crosslinked with 5% phenol and 95% methyl silicon was used at a 
temperature gradient of 110-185 °C at 10°C per minute. The 
column was operated at a flow rate at 100 °C at 1.7cm 3 min _1 with 
a split ratio of 40:1 with a 2-1 injection volume using helium 
as the carrier gas. 

30 

EXAMPLE 2: RNA ISOLATION 

For RNA extractions, middle leaves from 2 month old 
greenhouse grown plants were treated with ethylene as 
35 described. The 0 and 24-48 hours samples were used for RNA 
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extraction. In some cases, leaf samples under the senescence 
process were taken from the plants 10 days post flower-head 
removal. These samples were also used for extraction. Total RNA 
was isolated using Rneasy Plant Mini Kit (Qiagen, Inc., 
5 Valencia, California) following manufacturer' s protocol. 

The tissue sample was grinded under liquid nitrogen to a 
fine powder using a DEPC treated mortar and pestle. 
Approximately 100 mg of ground tissue was transferred to a 

10 sterile 1.5 ml eppendorf tube. This sample tube was placed in 
liquid nitrogen until all samples were collected. Then, 450jal 
of Buffer RLT as provided in the kit (with the addition of (3- 
Mercaptoethanol ) was added to each individual tube. The sample 
was vortexed vigorously and incubated at 5 6° C for 3 minutes. 

15 The lysate was then, applied to the QIAshredder spin column 
sitting in a 2 -ml collection tube, and centrifuged for 2 
minutes at maximum speed. The flow through was collected and 
0.5 volume of ethanol was added to the cleared lysate. The 
sample is mixed well and transferred to an Rneasy mini spin 

20 column sitting in a 2 ml collection tube. The sample was 
centrifuged for 1 minute at 10, OOOrpm. Next, 700jnl of buffer 
RW1 was pipeted onto the Rneasy column and centrifuged for 1 
minute at 10, OOOrpm. Buffer RPE was pipetted onto the Rneasy 
column in a new collection tube and centrifuged for 1 minute at 

25 10, 000 rpm. Buffer RPE was again, added to the Rneasy spin 
column and centrifuged for 2 minutes at maximum speed to dry 
the membrane. To eliminate any ethanol carry over, the 
memebrane was placed in a separate collection tube and 
centrifuged for an additional 1 minute at maximum speed. The 

30 Rneasy column was transfered into a new 1 . 5 ml collection tube, 
and 40 \xl of Rnase-free water was pipetted directly onto the 
Rneasy membrane. This final elute tube was centrifuged for 1 
minute at 10, OOOrpm. Quality and quantity of total RNA was 
analyzed by denatured formaldehyde gel and spectrophotometer. 

25 
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Poly(A)RNA was isolated using Oligotex poly A RNA 
purification kit (Qiagen Inc.) following manufacture's 
protocol. About 200 jag total RNA in 250 \xl maximum volume was 
5 used. A volume of 250p.l of Buffer OBB and 15 ]xl of Oligotex 
suspension was added to the 250 p.1 of total RNA. The contents 
were mixed thoroughly by pipetting and incubated for 3 minutes 
at 7 0°C on a heating block. The sample was then, placed at 
room temperature for approximately 2 0 minutes. The 

10 oligotex smRNA complex was pelleted by centrif ugation for 2 
minutes at maximum speed. All but 50 ]il of the supernatant was 
removed from the microcentrifuge tube. The Sample was treated 
further by OBB buffer. The oligotex :mRNA pellet was resuspended 
in 400 ]il of Buffer OW2 by vortexing. This mix was transferred 

15 onto a small spin column placed in a new tube and centrifuged 
for 1 minute at maximum speed. The spin column was transferred 
to a new tube and an additional 400 p.1 of Buffer OW2 was added 
to the column. The tube was then centrifuged for 1 minute at 
maximum speed. The spin column was transferred to a final 

20 1.5ml microcentrifuge tube. The sample was eluted with 60 ul 
of hot (70 C) Buffer OEB. Poly A product was analyzed by 
denatured formaldehyde gels and spectrophotometry analysis. 

25 EXAMPLE 3: REVERSE TRANSCRIPT10N-PCR 

First strand cDNA was produced using Superscript reverse 
transcriptase following manufacturer's protocol (Invitrogen, Carlsbad, 
California) . The poly A enriched RNA/oligo dT primer mix consisted of 

30 less than 5 p.g of total RNA, 1 ul of lOmM dNTP mix, 1 jil of Oligo 
d(T)i2-i8 (0.5vtg/]al) , and up to 10 pi of DEPC-treated water. Each 
sample was incubated at 65° C for 5 minutes, then placed on ice for at 
least 1 minute. A reaction mixture was prepared by adding each of the 
following components in order: 2 ]il 10X RT buffer, 4 ]il of 25 mM 

35 MgC12, 2]il of 0.1 M DTT, and' 1 ]il of RNase OUT Recombinant RNase 
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Inhibitor. An addition of 9 ]il of reaction mixture was pipetted to 
each RNA/primer mixture and gently mixed* It was incubated at 42° C 
for 2 minutes and 1 yil of Super Script II RT was added to each tube. 
The tube was incubated for 50 minutes at 42° C. The reaction was 
5 terminated at 70° C for 15 minutes and chilled on ice. The sample was 
collected by centrifugation and 1 p.1 of RNase H was added to each tube 
and incubated for 20 minutes at 37° C. The second PGR was carried out 
with 200 pmoles of forward primer (degenerate primers as in Figure 75, 
SEQ.ID Nos, 149-156) and 100 pmoles reverse primer (mix of 18nt oligo 
10 d(T) followed by 1 random base) * 



Reaction conditions were 94 °C for 2 minutes and then performed 4 0 
cycles of PGR at 94°C for 1 minute, 45° to 60°C for 2 minutes, 72°C 
for 3 minutes with a 72 °C extension for an extra 10 min. 

15 

Ten microliters of the amplified sample were analyzed by 
electrophoresis using a 1% agarose gel. The correct size fragments 
were purified from agarose gel. 

20 

EXAMPLE 4: GENERATION OF PGR FRAGMENT POPULATIONS 



PGR fragments from Example 3 were ligated into a pGEM-T Easy 
Vector (Pr omega, Madison, Wisconsin) following manufacturer's 

25 instructions. The ligated product was transformed into JM109 
competent cells and plated on LB media plates for blue/white 
selection. Colonies were selected and grown in a 96 well plate with 
1.2 ml of LB media overnight at 37°C. Frozen stock was generated for 
all selected colonies. Plasmid DNA from plates were purified using 

30 Beckman's Biomeck 2000 miniprep robotics with Wizard SV Miniprep kit 
(Promega) . Plasmid DNA was eluted with 100|alwater and stored in a 96 
well plate. Plasmids were digested by EcoRl and were analyzed using 1% 
agarose gel to confirm the DNA quantity and size of inserts. The 
plasmids containing a 400-600 bp insert were sequenced using an CEQ 

35 2000 sequencer (Beckman, Fullerton, California) * The sequences were 
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aligned with GenBank database by BLAST search* The P-450 related 
fragments were identified and further analyzed* 

5 EXAMPLE 5: CONSTRUCTION OF CDNA LIBRARY 

A cDNA library was constructed by preparing total RNA from 
ethylene treated leaves as follows* First, total RNA was extracted 
from ethylene treated leaves of tobacco line 58-33 using a modified 

10 acid phenol and chloroform extraction protocol* Protocol was modified 
to use one gram of tissue that was ground and subsequently vortexed in 
5 ml of extraction buffer (100 mM Tris-HCl, pH 8*5; 200 mM NaCl; 
lOmM EDTA; 0.5% SDS) to which 5 ml phenol (pH5*5) and 5 ml chloroform 
was added. The extracted sample was centrifuged and the supernatant 

15 was saved* This extraction step was repeated 2-3 more times until the 
supernatant appeared clear. Approximately 5 ml of chloroform was 
added to remove trace amounts of phenol. RNA was precipitated from 
the combined supernatant fractions by adding a 3-fold volume of ETOH 
and 1/10 volume of 3M NaOAc (pH5.2) and storing at -20° C for 1 hour* 

20 After transferring to Corex glass container it was centrifuged at 
9,000 RPM for 45 minutes at 4° C. The pellet was washed with 70% 
ethanol and spun for 5 minutes at 9,000 RPM at 4° C. After drying the 
pellet, the pelleted RNA was dissolved in 0.5 ml RNase free water. 
The pelleted RNA was dissolved in 0.5 ml RNase free water. The 

25 quality and quantity of total RNA was analyzed by denatured 
formaldehyde gel and spectrophotometer, respectively* 

The resultant total RNA was isolated for poly A+ RNA using an 
Oligo(dT) cellulose protocol (Invitrogen) and Microcentrifuge spin 

30 columns (Invitrogen) by the following protocol. Approximately twenty 
mg of total RNA was subjected to twice purification to obtain high 
quality poly A+ RNA* Poly A+ RNA product was analyzed by performing 
denatured formaldehyde gel and subsequent RT-PCR of known full-length 
genes to ensure high quality of mRNA. In addition, Northern analysis 

35 was performed on the poly A+RNA from ethylene treated non-converter 
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leaves, zero hour ethylene treated converter leaves and ethylene 
treated converter leaves using the full-length p450 as probe. The 
method was based on the protocol provided by the manufacturer's 
instructions (KPL RNADetector Northern Blotting Kit, Gaithersburg, 
5 Maryland) using 1.8 \xg of polyA4RNA for each sample. RNA containing 
gels were transferred overnight using 20X SSC as a transfer buffer. 



Next, poly A+ RNA was used as template to produce a cDNA library 

employing cDNA synthesis kit, ZAP-cDNA synthesis kit, and ZAP-cDNA 
10 Gigapack III gold cloning kit (Stratagene, La Jolla, California) . The 

method involved following the manufacture's protocol as specified. 

Approximately 8 jug of poly A+ RNA was used to construct cDNA library. 

Analysis of the primary library revealed about 2,5 x 10 6 - Ix 10 7 pfu. 

A quality background test of the library was completed by 
15 complementation assays using IPTG and'X-gal, where recombinant plaques 

was expressed at more than 100-fold above the background reaction. 



A more quantitative analysis of the library by random PCR showed 
that average size of insert cDNA was approximately 1.2 kb. The method 

20 used a two-step PCR method as followed. For the first step, reverse 
primers were designed based on the preliminary sequence information 
obtained from P450 fragments. The designed reverse primers and T3 
(forward) primers were used amplify corresponding genes from the cDNA 
library. PCR reactions were subjected to agarose electrophoresis and 

25 the corresponding bands of high molecular weight were excised, 

purified, cloned and sequenced. In the second step, new primers 
designed from 5'UTR or the start coding region of P450 as the forward 
primers together with the reverse primers (designed from 3'UTR of 
P450) were used in the subsequent PCR to obtain full-length P450 

30 clones. 



The P450 fragments were generated by PCR amplification from the 
constructed cDNA library as described in example 3 with the exception 
of the reverse primer. The T7 primer located on the plasmid downstream 
35 of cDNA inserts (see Figure 75), was used as a reverse primer. PCR 
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fragments were isolated, cloned and sequenced as described in Example 
4. 
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Numerous modifications and variations in practice of the 
invention are expected to occur to those skilled in the art upon 
consideration of the foregoing detailed description of the invention. 
Consequently, such modifications and variations are intended to be 
5 included within the scope of the following claims. 

EXAMPLE 6; CHARACTERIZATION OF CLONED FRAGMENTS - REVERSE SOUTHERN 
BLOTTING ANALYSIS 



10 Nonradioactive large scale reverse southern blotting assay was 

performed on all P450 clones identified in above examples to detect 
the differential expression. It was observed that the level of 
expression among different P450 clusters was very different. Further 
real time detection was conducted on those with high expression. 

15 

Nonradioactive southern blotting procedures were conducted as 
follows . 

1) Total RNA was extracted from ethylene treated converter (58- 
33) and nonconverter (58-25) leaves using the Qiagen Rnaeasy kit as 
20 described in Example 2. 



2) Probe was produced by biotin-tail labeling a single strand 
cDNA derived from poly A enriched RNA generated in above step. This 
labeled single strand cDNA was generated by RT-PCR of the converter 
25 and nonconverter total RNA (Invitrogen) as described in example 3 with 
the exception of using biotinalyted oligo dT as a primer (Promega) ; 
These were used as a probe to hybridize with cloned DNA. 



3) Plasmid DNA was digested with restriction enzyme EcoRl and 
30 run on agarose gels. Gels were simultaneously dried and transferred 
to two nylon membranes (Biodyne B) . One membrane was hybridized with 
converter probe and the other with nonconverter probe. Membranes were 
UV-crosslinked (auto crosslink setting, 254 nm, Stratagene, 
Stratalinker ) before hybridization. 

35 
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Alternatively, the inserts were PGR amplified from each plasmid 
using the sequences located on both arms of p-GEM plasmid, T3 and SP6, 
as primers. The PCR products were analyzed by running on a 96 well 
Ready-to-run agarose gels. The confirmed inserts were dotted on two 
nylon membranes. One membrane was hybridized with converter probe and 
the other with nonconverter probe . 

4) . The membranes were hybridized and washed following 
manufacture's instruction with the modification of washing stringency 

(Enzo Diagnostics, Inc, Farmingdale, NY) . The membranes were 
prehybridized with hybridization buffer (2x SSC buffered formamide, 
containing detergent and hybridization enhancers) at 42°C for 3 0 min 
and hybridized with 1 Ojul denatured probe overnight at 42°C. The 
membranes then were washed in IX hybridization wash buffer 1 time at 
room temperature for 10 min and 4 times at 68°C for 15 min. The 
membranes were ready for the detection. 

5) The washed membranes were detected by alkaline phosphatase 
labeling followed by NBT/BCIP colometric detection as described in 
manufacture's detection procedure (Enzo Diagnostics, Inc.). The 
membranes were blocked for one hour at room temperature with lx 
blocking solution, washed 3 times with IX detection reagents for 10 
min, washed 2 times with lx predevelopment reaction buffer for 5 min 
and then developed the blots in developing solution for 30-45 min 
until the dots appear. All reagents were provided by manufacture (Enzo 
Diagnostics, Inc) . 

In some cases, one step RT-PCR (Gibco Kit, Carlsbad, California) 
was performed on the total RNA' s from non-converter (58-25) and 
converter (58-33) lines using primers specific to the P-450 fragments. 
Comparative RT-PCR was conducted as follows: 

1) Total RNA from ethylene treated converter (58-33) and 

nonconverter (58-25) plant leaves was extracted as described 
in example 2. 
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2) Poly (A) RNA from total RNA was extracted using Qiagen kit as 
described in example 2. 

3) One step RT-PCR was conducted using primers specific to 
cloned P450 following the manufactures procedure (Invitogen) . The poly 

5 A enriched RNA was added to the reaction mix, along with, 25jlx1 of 2X 

Reaction Mix, 1 jutl of IOjliM Sense Primer, ljutl of 10 jliM Anti-sense Primer, 
1 jllI of RT/ Platinum taq Mix, and up to 50 of water. Reaction 
conditions were 50 °C for 20 minutes and then 94 C for 2 min, performed 
40 cycles of PGR at 94°C for 30 sec, 55° to for 30 sec, 70°C for 1 
10 minute with a 72 °C extension for an extra 10 min* Ten microliters of 
the amplified sample were analyzed by electrophoresis using a 1% 
agarose gel. 

15 EXAMPLE 7: CHARACTERIZATION OF CLONED FRAGMENTS - NORTHERN BLOT 
ANALYSIS 

Alternative to Southern Blot analysis, some membranes were 
hybridized and detected as described in the example of northern 
20 blotting assays. Northern Hybridization was used to detect mRNA 
differentially expressed in Nicotiana as follows. 

First step, probe preparation: the random priming method was 
used to prepare probes from cloned p450 DNA fragments (Random Primer 

25 DNA Biotinylation Kit, KPL) . The following components were mixed: 
O.Sjag DNA template (boiled in a water bath for 5-10 minutes and 
chilled on ice before use) ; IX Random Primer Solution; IX dNTP mix; 
10 units of Klenow and water was added to bring the reaction to 50^1. 
The mixture was incubated in 37 °C for 1-4 hours. The reaction was 

30 stopped with 2pl of 200 mM EDTA. The probe was denatured by incubating 
at 95 °C for 5 minutes before use. 

Second step, sample preparation: The RNA samples were prepared 
from ethylene treated and non-treated fresh leaves, and senescence 
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leaves. In some cases poly A enriched RNA was used. Approximately 
15|xg total RNA or 1.8ug mRNA (Methods of RNA and mRNA extraction are 
described in Example 5) was brought to egual volume with DEPC H20{5-10 
ul) . The same volume loading buffer (1 x MOPS; 18.5 % Formaldehyde; 50 
% Formamide; 4 % Ficoll400; Bromophenolblue) and 0.5 ul EtBr (0.5ug 
/ul) were added. The samples were heated at 90 °C for 5 minutes, and 
chilled on ice. 

Third step, separation of RNA by electrophoresis: Samples were 
subjected to electrophoresis on a formaldehyde gel (1 % Agarose, 1 x 
MOPS, 0.6 M Formaldehyde) with 1XMOP buffer (0.4 M 

Morpholinopropanesulfonic acid; 0.1 M Na-acetate-3 x H20; 10 mM EDTA; 
adjust to pH 7.2 with NaOH) . RNAs were transferred to Hybond-N+ 
membrane (Nylon, Amersham Pharmacia Biotech) by capillary method in 10 
X SSC buffer (1.5 M NaCl; 0.15 M Na-citrate ) for 24 hours. Membranes 
with RNA samples were UV- cross linked (auto crosslink setting, 254 nm, 
Stratagene, Stratalinker ) before hybridization. 
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Fourth step, hybridization: The membrane was prehybridized for 1- 
4 hours at 42 °C with 5-10 ml prehybridization buffer (5 x SSC; 50 % 
Formamide; 5 x Denhardt ' s-solution; 1 % SDS; lOO^g/ml heat-denatured 
sheared non- homologous DNA) . Old prehybridization buffer was 
discarded, and new prehybridization buffer and probe were added. The 
hybridization was carried out over night at 42 °C. The membrane was 
washed for 15 minutes with 2 x SSC at room temperature, followed by a 
wash with 2 x SSC, 0.1 % SDS at 65 °C for 2 times, and a final wash 
with 0.1 x SSC, or more wash with 0.1 x SDS at 65 °C (optional) . 

Fifth step, detection: AP-Streptavidin and CDP-Star were used to 
detect the hybridization signal ( KPL' s DNA Detector Northern blotting 
Kit) . The membrane was blocked with IX Detector Block Solution for 30 
minutes at room temperature. The blocking buffer was discarded and the 
membrane was incubated in new IX detector Block Solution with 1:10,000 
AP-SA at room temperature for 1 hour. The membrane was washed in IX 
Phosphatase Wash Solution for 3 times, followed by a wash with IX 
Phosphatase Assay Buffer for two times. The signal was detected with 
CDP-Star Chemi luminescent Substrate. The wet membrane was exposed to 
X-Ray film under saran™ wrap. The results were analyzed and recorded. 

A major focus of the invention was the discovery of novel genes 
that may be induced as a result of ethylene treatment or play a key 
role in tobacco leaf quality and constituents. As shown in the table 
below, Northern blots were useful in determining which genes were 
induced by ethylene treatment relative to non- induced plants. 
Interestingly, not all fragments were affected similarly in the 
converter and nonconverter . The cytochrome P450 fragments of interest 
were partially sequenced to determine their structural relatedness. 
This information was used to subsequently isolate and sequence full 
length gene clones. Functional analysis utilizing down- regulation 
methods was performed in whole plants with the fragments genes. 
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Fragments 


Induced mRNA Expression 
Ethylene Treatment 


Converter 


Nonconverter 


D18 6-AH4 






D5 6-AC7 




+ 


D56-AG11 


+ 




D56-AC12 


+ 




D7 0A-AB5 


+ 


+ 


D73-AC9 


+ 




D7 0A-AA12 


+ 


+ 


D73A-AG3 






D73A-AE10 






D35-AG11 


+ 




D58-AD4 


+ 


• + 


D34-52 




+ 


D5 6-AG6 




+ 



5 EXAMPLE 8: NUCLEIC ACID IDENTITY AND STRUCTURE RE LATE DNE S S OF 
ISOLATED NUCLEIC ACID FRAGMENTS' 

Over 100 cloned P450 fragments were sequenced in conjunction with 
Northern blot analysis to determine their structural relatedness. The 

10 approach used utilized forward primers based either of two common P450 
motifs located near the carboxyl-terminus of the P450 genes. The 
forward primers corresponded to cytochrome P450 motifs FXPERF or 
GRRXCP(A/G) as denoted in Figure 1. The reverse primers used standard 
primers from either the plasmid, SP6 or T7 located on both arms of 

15 pGEM plasmid, or a poly A tail. The protocol used is described below. 

Spectrophotometry was used to estimate the concentration of 
starting double stranded DNA following the manufacturer's protocol 
(Beckman Coulter) . The template was diluted with water to the 
20 appropriate concentration, denatured by heating at 95° C for 2 

minutes, and subsequently placed on ice. The sequencing reaction was 
prepared on ice using 0.5 to lOyil of denatured DNA template, 2 p.1 of 
1.6 pmole of the forward primer, 8 p.1 of DTCS Quick Start Master Mix 
and the total volume brought to 20 pi with water. The thermocycling 
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program consisted of 30 cycles of the follow cycle: 96° C for 20 
seconds, 50° C for 20 seconds, and 60° C for 4 minutes followed by 
holding at 4° C. 

5 The sequence was stopped by adding 5 ]il of stop buffer (equal 

volume of 3M NaOAc and lOOraM EDTA and 1 p.1 of 20 mg/ml glycogen) . The 
sample was precipitated with 60 ]il of cold 95% ethanol and centrifuged 
at 6000g for 6 minutes. Ethanol was discarded* The pellet was 2 
washes with 200 pi of cold 70% ethanol. After the pellet was dry, 40 
10 yxl of SLS solution was added and the pellet was resuspended. A layer 
of mineral oil was over laid* The sample was then, placed on the CEQ 
8 000 Automated Sequencer for further analysis. 

In order to verify nucleic acid sequences, nucleic acid sequence 
15 was re-sequenced in both directions using forward primers to the 

FXPERF or GRRXCP(A/G) region of the P450 gene or reverse primers to 
either the plasmid or poly A tail. All sequencing was performed at 
least twice in both directions. 

20 The nucleic acid sequences of cytochrome P450 fragments were 

compared to each other from the coding region corresponding to the 
first nucleic acid after the region encoding the GRRXCP (A/G) motif 
through to the stop codon. This region was selected as an indicator 
of genetic diversity among P450 proteins. A large number of 

25 genetically distinct P450 genes, in excess of 70 genes, was observed 
similar to that of other plant species. Upon comparison of nucleic 
acid sequences, it was found that the genes could be placed into 
distinct sequences groups based on their sequence identity. It was 
found that the best unique grouping of P450 members was determined to 

30 be those sequences with 75% nucleic acid identity or greater (shown in 
Table I) . Reducing the percentage identity resulted in significantly 
larger groups. A preferred grouping was observed for those sequences 
with 81% nucleic acid identity or greater, a more preferred grouping 
91% nucleic acid identity or greater, and a most preferred grouping 

35 for those sequences 99% nucleic acid identity of greater. Most of the 
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groups contained at least two members and frequently three or more 
members. Others were not repeatedly discovered suggesting that 
approach taken was able to isolated both low and high expressing mRNA 
in the tissue used. 

Based on 75% nucleic acid identity or greater, two cytochrome 
P450 groups were found to contain nucleic acid sequence identity to 
previously tobacco cytochrome genes that genetically distinct from 
that within the group. Group 23, showed nucleic acid identity, within 
the parameters used for Table I, to prior GenBank sequences of 
GI: 1171579 (CAA64635) and 61: 14423327 (or AAK62346) by Czernic et al 
and Ralston et al, respectively. GI: 1171579 had nucleic acid identity 
to Group 23 members ranging 96.9% to 99.5% identity to members of 
Group 23 while GI: 14423327 ranged 95.4% to 96.9% identity to this 
group. The members of Group 31 had nucleic acid identity ranging from 
7 6.7% to 97.8% identity to the GenBank reported sequence of 
GI: 14423319 (AAK62342) by Ralston et al . None of the other P450 
identity groups of Table 1 contained parameter identity, as used in 
Table 1, to Nicotiana P450s genes reported by Ralston et al, Czernic 
et al., Wang et al or LaRosa and Smigocki. 

As shown in Figure 7 6, consensus sequence with appropriate 
nucleic acid degenerate probes could be derived for group to 
preferentially identify and isolate additional members of each group 
from Nicotiana plants. 
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Table I: Nicotiana P450 Nucleic Acid Sequence Identity 
Groups 

GROUP FRAGMENTS 

1 D58-BG7 (SEQ ID No.:l) ; D58-AB1 (SEQ ID No.: 3); D58-BE4 
(SEQ ID No. : 7) 

2 D56-AH7 (SEQ ID No.: 9); D13a-5 (SEQ ID No.: 11) 

3 D56-AG10 (SEQ ID No.: 13); D35-33 (SEQ ID No.: 15); D34- 
62 ) (SEQ ID No. :17) 

4 D56-AA7 (SEQ ID No.: 19); D56-AE1 (SEQ ID No.:21); 185- 
BD3 (SEQ ID No. :143) 

5 D35-BB7 (SEQ ID No.:23); D177-BA7 (SEQ ID No.:25) D56A- 
AB6 (SEQ ID No.:27); D144-AE2 (SEQ ID No.:29) 

6 D56-AG11 (SEQ ID No.: 31); D179-AA1 (SEQ ID No.: 33) 

7 D56-AC7 (SEQ ID No.: 35); D144-AD1 (SEQ ID No.: 37) 

8 D144-AB5 (SEQ ID No.:39) 

9 D181-AB5 (SEQ ID No.: 41); D73-Ac9 (SEQ ID No.: 43) 

10 D56-AC12 (SEQ ID No.:45) 

11 D58-AB9 (SEQ ID No.:47); D56-AG9 (SEQ ID No. :49); D56- 
AG6 (SEQ ID No.:51); D35-BG11 (SEQ ID No.:53); D35-42 (SEQ 
ID No.:55); D35-BA3 (SEQ ID No.:57); D34-57 (SEQ ID 

No.: 59); D34-52 (SEQ ID No.: 61); D34-25 (SEQ ID No.: 63) 

12 D56-AD10 (SEQ ID No.: 65) 

13 56-AA11 (SEQ ID No.: 67) 

14 D177-BD5 (SEQ ID No.: 69); D177-BD7 (SEQ ID No.: 83) 

15 D56A-AG10 (SEQ ID No.:71); D58-BC5 (SEQ ID No.:73); 
D58-AD12 (SEQ ID No.:75) 

16 D56-AC11 (SEQ ID No.: 77); D35-39 (SEQ ID No.: 79); D58- 
BH4 (SEQ ID No.: 81); D56-AD6 (SEQ ID No.: 87) 
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17 D73A-AD6 (SEQ ID No.:89); D70A-BA11 (SEQ ID No.:91); 
D70A-BB5 (SEQ ID No.: 93) 

18 D70A-AB5 (SEQ ID No.: 95); D70A-AA8 (SEQ ID No.: 97) 

19 D7 0A-AB8 (SEQ ID No.: 99); D70A-BH2 (SEQ ID No. : 101); 
5 D70A-AA4 (SEQ ID No.:103) 

20 D70A-BA1 (SEQ ID No.:105); D70A-BA9 (SEQ ID No.:107); 
D176-BG2 (SEQ ID No.: 141) 

21 D70A-BD4 (SEQ ID No.: 109) 

22 D181-AC5 (SEQ ID No. rill); D144-AH1 (SEQ ID No.: 113); 
10 D34-65 (SEQ ID No.:115) 

23 D35-BG2 (SEQ ID No.:117) 

24 D73A-AH7 (SEQ ID No.: 119) 

25 D58-AA1 (SEQ ID No.: 121); D185-BC1 (SEQ ID No. : 133); 
D185-BG2 (SEQ ID No.:135) 

15 26 D73-AE10 (SEQ ID No.: 123) 

27 D56-AC12 (SEQ ID No.:125) 

28 D177-BF7 (SEQ ID No . : 127 ) ; D185-BE1 (SEQ ID No . : 137 ) ; 
185-BD2 (SEQ ID No . : 139) 

29 D73A-AG3 (SEQ IDNo.:129) 

20 30 D70A-AA12 (SEQ ID No.:131); D176-BF2 (SEQ ID No.:85) 

31 D176-BC3 (SEQ ID No.:145) 

32 D176-BB3 (SEQ ID No.: 147) 

33 D186-AH4 (SEQ ID No.:5) 

25 EXAMPLE 9: RELATED AMINO ACID SEQUENCE IDENTITY OF ISOLATED 

NUCLEIC ACID FRAGMENTS 

The amino acid sequences of nucleic acid sequences obtained for 
cytochrome P450 fragments from Example 8 were deduced. The deduced 
30 region corresponded to the amino acid immediately after the 
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GXRXCP(A/G) sequence motif to the end of the carboxyl- terminus, or 
stop codon. Upon comparison of sequence identity of the fragments, a 
unique grouping was observed for those sequences with 70% amino acid 
identity or greater. A preferred grouping was observed for those 
sequences with 80% amino acid identity or greater, more preferred with 
90% amino acid identity or greater, and a most preferred grouping for 
those sequences 99% amino acid identity of greater. The groups and 
corresponding amino acid sequences of group members are shown in 
Figure 2. Several of the unique nucleic acid sequences were found to 
have complete amino acid identity to other fragments and therefore 
only one member with the identical amino acid was reported. 

The amino acid identity for Group 19 of Table II corresponded to 
three distinct groups based on their nucleic acid sequences. The 
amino acid sequences of each group member and their identity is shown 
in Figure. 77. The amino acid differences are appropriated marked. 

At least one member of each amino acid identity group was 
selected for gene cloning and functional studies using plants. In 
addition, group members that are differentially affected by ethylene 
treatment or other biological differences as assessed by Northern and 
Southern analysis were selected for gene cloning and functional 
studies. To assist in gene cloning, expression studies and whole 
plant evaluations, peptide specific antibodies will be prepared on 
sequence identity and differential sequence. 

Table II: Nicotiana P450 Amino Acid Sequence Identity Groups 

GROUP FRAGMENTS 

1 D58-BG7 (SEQ ID No.: 2), D58-AB1 (SEQ ID No.: 4) 

2 D58-BE4 (SEQ ID No.: 8) 

3 D56-AH7 (SEQ ID No.: 10); D13a-5 (SEQ ID No.: 12) 
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4 D56-AG10 (SEQ ID No.:14); D34- 
(SEQ ID No. : 18) 

5 D56-AA7 (SEQ ID No.: 20); D56-AE1 (SEQ ID No.: 22); 185- 
BD3 (SEQ ID No. :144) 

6 D35-BB7 (SEQ ID No.: 24); D177-BA7 (SEQ ID No.: 26); 
D56A-AB6 (SEQ ID No.: 28); D144-AE2 (SEQ ID No.: 30) 

7 D56-AG11 (SEQ ID No.:32); D179-AA1 (SEQ ID No.:34) 

8 D56-AC7 (SEQ ID No.: 36); D144-AD1 (SEQ ID No.: 38) 

9 D144-AB5 (SEQ ID No.:40) 

10 D181-AB5 (SEQ ID No.: 42); D73-Ac9 (SEQ ID No.: 44) 

11 D56-AC12 (SEQ ID No.: 46) 

12 D58-AB9 (SEQ ID No.:48); D56-AG9 (SEQ ID No.:50); D56- 
AG6 (SEQ ID No.:52); D35-BG11 (SEQ ID No.:54); D35-42 (SEQ 
ID No.: 56); D35-BA3 (SEQ ID No.: 58); D34-57 (SEQ ID 

No.: 60); D34-52 (SEQ ID No.: 62) 

13 D56AD10 (SEQ ID No.: 66) 

14 56-AA11 (SEQ ID No.: 68) 

15 D177-BD5 (SEQ ID No.: 70); D177-BD7 (SEQ ID No.: 84) 

16 D56A-AG10 (SEQ ID No.: 72); D58-BC5 (SEQ ID No.: 74); 
D58-AD12 (SEQ ID No.: 76) 

17 D56-AC11 (SEQ ID No.:78); D56-AD6 (SEQ ID No.:88) 

18 D73A-AD6 (SEQ ID No.90:); D70A-BB5 (SEQ ID No.:94) 

19 D70A-AB5 (SEQ ID No.:96); D70A-AB8 (SEQ ID No.:100); 
D70A-BH2 (SEQ ID No.:102); D70A-AA4 (SEQ ID No.:104); D70A- 
BA1 (SEQ ID No.:106); D70A-BA9 (SEQ ID No . : 108) ; D176-BG2 
(SEQ ID No. :142) 

20 D70A-BD4 (SEQ ID No.: 110) 

21 D181-AC5 (SEQ ID No.: 112); D144-AH1 (SEQ ID No.: 114); 
D34-65 (SEQ IDNo.:116) 

22 D35-BG2 (SEQ ID No.: 118) 
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23 D73A-AH7 (SEQ ID No.:120) 

24 D58-AA1 (SEQ ID No.: 122); D185-BC1 (SEQ ID No.: 134) 
D185-BG2 (SEQ ID No.: 136) 

25 D73-AE10 (SEQ ID No.:124) 

26 D56-AC12 (SEQ ID No.:126) 

27 D177-BF7 (SEQ ID No.: 128); 185-BD2 (SEQ ID No.: 140) 

28 D73A-AG3 (SEQ ID No.: 130) 

29 D70A-AA12 (SEQ ID No.:132); D176-BF2 (SEQ 
ID No. : 86) 

30 D176-BC3 (SEQ ID No.: 146) 

31 D176-BB3 (SEQ ID No .: 148) 

32 D186-AH4 (SEQ ID No.: 6) 



EXAMPLE 10: CLONING OF FULL LENGTH cDNA P450 CLONES 

A cDNA library was constructed by preparing total RNA from 
ethylene treated leaves as follows. First, total RNA was 
extracted from ethylene treated leaves using a' modified acid 
phenol and chloroform extraction protocol. Protocol was 
modified to use one gram of tissue that was ground and 
subsequently vortexed in 5 ml of extraction buffer (100 mM 
Tris-HCl, pH 8.5; 200 mM NaCl; lOmM EDTA; 0.5% SDS) to which 
5 ml phenol (pH5.5) and 5 ml chloroform was added. The 
extracted sample was centrifuged and the supernatant was saved. 
This extraction step was repeated 2-3 more times until the 
supernatant appeared clear. Approximately 5 ml of chloroform 
was added to remove trace amounts of phenol. RNA was 
precipitated from the combined supernatant fractions by adding 
a 3-fold volume of ETOH and 1/10 volume of 3M NaOAc (pH5.2) and 
storing at -20° C for 1 hour. After transfering to Corex glass 
container it was centrifuged at 9,000 RPM for 45 minutes at 4° 
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C. The pellet was ■ washed with 70% ethanol and spun for 5 
minutes at 9, 000 RPM at 4° C. After drying the pellet, the 
pelleted RNA was dissolved in 0.5 ml RNase free water. The 
pelleted RNA was dissolved in 0.5 ml RNase free water. The 
quality and quantity of total RNA was analyzed by denatured 
formaldehyde gel and spectrophotometer, respectively. 

The resultant total RNA was isolated for poly A+ RNA using 
an Oligo(dT) cellulose protocol (Invitrogen) and 
Microcentrifuge spin columns ( Invitrogene) by the following 
protocol. Approximately twenty mg of total RNA was subjected 
to twice purification to obtain high quality poly A+ RNA. Poly 
A+ RNA product was analyzed by performing denatured 
formaldehyde gel and subsequent RT-PCR of known full-length 
genes to ensure high quality of mRNA. In addition, Northern 
analysis was performed on the poly A+RNA from ethylene treated 
non-converter leaves, zero hour ethylene treated converter 
leaves and ethylene treated converter leaves using the full- 
length p450 as probe. The method was based on the protocol 
provided by the manufacturer's instructions (KPL RNADetector 
Northern Blotting Kit) using 1.8 ug of polyA+RNA for each 
sample. RNA containing gels were transferred overnight using 
20X SSC as a transfer buffer. 

Next, poly A+ RNA was used as template to produce a cDNA 
library employing cDNA synthesis kit, ZAP-cDNA synthesis kit, 
and ZAP-cDNA Gigapack III gold cloning kit (Stratagene) . The 
method involved following the manufacture's protocol as 
specified. Approximately 8 ug of poly A+ RNA was used to 
construct cDNA library. Analysis of the primary library 
revealed about 2.5 x 106 - lx 107 pfu. A quality background 
test of the library was completed by a- complementation using 
IPTG and X-gal, where recombinant plaques was expressed at more 
than 100-fold above the background reaction. 
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A more quantitative analysis of the library by random PCR 
showed that average size of insert cDNA was approximately 1.2 
kb. The method used a two-step PCR method as followed. For 
the first step, reverse primers were designed based on the 
preliminary sequence information obtained from p450 fragments. 
The designed reverse primers and T3 (forward) primers were used 
amplify corresponding genes from the cDNA library. PCR 
reactions were subjected to agarose electrophoresis and the 
corresponding bands of high molecular weight were excised, 
purified, cloned and sequenced. In the second step, new 
primers designed from 5'UTR or the start coding region of p450s 
as the forward primers together with the reverse primers 
(designed from 3'UTR of p450) were used in the subsequent PCR 
to obtain full-length p450 clones. 

Full-length p450 genes were isolated by PCR method from 
constructed cDNA library. Two steps of PCR were used to clone 
the full-length genes. In the first step PCR, unspecific 
reverse primer (T3) and specific forward primer (generated from 
the downstream sequence of P450s) were used to clone the 5 ' end 
of the P450s from cDNA library. PCR fragments were isolated, 
cloned and sequenced for designing the forward primers in next 
step PCR. Two specific primers were used to clone the full- 
length p450 clones in the second step PCR. The clones were 
subsequently sequenced. 

Numerous modifications and variations in practice of the 
invention are expected to occur to those skilled in the art 
upon consideration of the foregoing detailed description of the 
invention. Consequently, such modifications and variations are 
intended to be included within the scope of the following 
claims . 
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WHAT IS CLAIMED IS: 

1. An isolated nucleic acid molecule, wherein said 
nucleic acid molecule comprises a nucleic acid sequence 
selected from the group consisting of SEQ. ID- 1, 3, 5, 7, 
9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 
39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 61, 63, 65, 
67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 93, 
95, 97, 99, 101, 103, 105, 107, 109, 111, 113, 115, 117, 
119, 121, 123, 125, 127, 129, 131, 133, 135, 137, 139, 
141, 143, 145, 147 or 149, 

2* The isolated nucleic acid molecule of claim 1, 
wherein said nucleic acid molecule comprises a fragment of 
a cytochrome P450 gene. 

3. An isolated nucleic acid molecule, wherein said 
nucleic acid molecule has at least 75% identity to the 
nucleic acid molecule of Claim 1. 

4. An isolated nucleic acid molecule, wherein said 
nucleic acid molecule has at least 91% identity to the 
nucleic acid molecule of Claim 1. 

5. An isolated nucleic acid molecule, wherein said 
nucleic acid molecule has at least 99% identity to the 
nucleic acid molecule of Claim 1. 
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6. A transgenic plant, wherein said transgenic plant 
comprises the nucleic acid molecule of claim 1, 2, 3, 4 or 
5. 

5 7. The transgenic plant of Claim 6, wherein said plant 
is a tobacco plant. 

8. A method of producing a transgenic plant, said method 
comprising the steps of: 
10 (i) operably linking the nucleic acid molecule of 

Claims 1, 2, 3, 4 or 5, with a promoter functional in said 
plant to create a plant transformation vector; and 

(ii) transforming said plant with said plant 
transformation vector of step (i) ; 

15 (iii) selecting a plant cell transformed with said 
transformation vector; and 

(iv) regenerating a plant from said plant cell of step 

(iii) . 

20 9. The method of Claim 8, wherein said nucleic acid 
molecule is in an antisense orientation. 

10. The method of Claim 8, wherein said nucleic acid 
molecule is in a sense orientation. 
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11. The method of Claim 8 wherein said nucleic acid is in 
a RNA interference orientation. 



12. The method of Claim 11, wherein said nucleic acid 

5 molecule is expressed as a double stranded RNA molecule. 

13. The method of Claim 11, wherein said double stranded 
RNA molecule is about 15 to 25 nucleotide in length. 

10 14. The method of Claim 8, wherein said plant is a 
tobacco plant. 

15. A method of selecting a plant containing a nucleic 
acid molecule, wherein said plant is analyzed for the 

15 presence of nucleic acid sequence, wherein said nucleic 
sequence acid selected from the group consisting of SEQ. 
ID 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 
31, 33, 35, 37, 39, 41, 43, 45, 47, 49, 50, 51, 53, 55, 
57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 

20 85, 87, 89, 91, 92, 93, 95, 97, 99, 101, 103, 105, 107, 
109, 111, 113, 115, 117, 119, 121, 123, 125, 127, 129, 
131, 133, 135, 137, 139, 141, 143, 145 or 147. 

16. The method of selecting a plant of claim 15, wherein 
25 said plant is analyzed by DNA hybridization. 
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17. The method of selecting a plant of claim 15, wherein 
said plant is analyzed by PCR detection. 



18. The' method of claim 16, wherein said DNA 
hybridization comprises a nucleic acid probe, said nucleic 
acid probe is selected from a group consisting of SEQ. ID. 
1, 3, 5, 7, 9, 11, 13, 15, 17,19, 21, 23, 25, 27, 29, 31, 
33, 35, 37, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 
61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 
89, 91 93, 95, 97, 99, 101, 103, 105, 107, 109, 111, 113, 
115, 117, 119, 121, 123, 125, 127, 129, 131, 133, 135, 
137, 139, 141, 143, 145 or 147. 

19. The method of selecting a plant of claim 15, wherein 
said plant is a transgenic plant. 

20. The method of selecting a plant of claim 15, wherein 
said plant is selected from a mutagenesis population. 

21. The method of selecting a plant of claim 15, wherein 
said plant is selected from a breeding population. 
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FIG. 1 



SEQ ID 1 D58-BG7 

1 GCACAACTT GCTAT CAACT TGGTCACATC TATGTTGGGT 

61 CATTTGTTGC AT C ATT T T AC ATGGGCTCCG GCCCCGGGGG TTAACCCGGA G GAT AT T G AC 

121 TTGGAGGAGA GCCCTGGAAC AGTAACT T AC AT GAAAAAT C CAATACAAGC TAT T C CAACT 

181 CCAAGATTGC CTGCACACTT GTATGGACGT GTGCCAGTGG ATATGTAA 
SEQ ID 2 

AQLAINLVTSMLGHLLHHFTWAPAPGVNPEDIDLEESPGTVTYMKNPIQAIPTPRLPAHLYGRVPVDM 



FIG. 2 

SEQ ID 3 D58-AB1 

1 GCACAACT T GCTAT CAAC TTGGTCACAT CTATGTTGGG 

61 TCATTTGTTG CATCATTTTA CGTGGGCTCC GCCCCCGGGG GTTAACC CGG AGAATATTGA 

121 CTTGGAGGAG AGCCCTGGAA CAGTAACTTA CAT GAAAAAT CCAATACAAG CTATTCCTAC 

181 TCCAAGATTG CCTGCACACT TGTATGGACG TGTGCCAGTG GATATGTAA 
SEQ ID 4 

AQLAINLVTSMLGHLLHHFTWAPPPGVNPENIDLEESPGTVTYMKNPIQAIPTPRLPAHLYGRVPVDM 



FIG. 3 

SEQ ID 5 D186-AH4 

1 ATGAATTAT T CAT T GCAAG TGGAACACCT TTCAATTGCT 
61 CAT AT GAT C C AAGGTTTCAG TTTTGCAACT ACGACCAATG AGCCTTTGGA TAT GAAACAA 
121 GGTGTGGGTT TAACTTTACC AAAGAAGACT GATGTTGAAG TGCTAATTAC ACCTCGCCTT 
181 CCTCCTACGC TTTATCAATA TTAA 
SEQ ID 6 

MM Y S LQVE HL S I AHMI O G F S FAT T T N E P L DMKQ GVGLT L P KKT DVEVL I T P RL PPT L YQ Y 



FIG. 4 

SEQ ID 7 D58-BE4 

1 GCACAACTT G CT AT CAACT TGGTCACATC TATGTTGGGT 

61 CATTTGTTCA TCATTTTACA TGGGCTCCGG CCCCGGGGGT TAACCCGGAG GATATTGACT 

121 TGGAGGAGAG CCCTGGAACA GTAACTTACA TGA 
SEQ ID 8 

AQLAINLVTSMLGHLFIILHGLRPRGLTRRILTWRRALEQ 



FIG. 5 

SEQ ID 9 D56-AH7 

1 GAAGGATTG GCTGTTCGAA TGGTTGCCTT GT CAT T GGGA 

61 TGTATTATTC AATGTTTTGA TTGGCAACGA AT CGGCGAAG AATTGGTTGA TATGACTGAA 

121 GGAACTGGAC TTACTTTGCC TAAAGCTCAA CCTTTGGTGG CCAAGTGTAG CCCACGACCT 

181 AAAAT GGCTA ATCTTCTCTC TCAGATTTGA 



SEQ ID 10 

EGIA.VRMVALSLGCIIQCFDWQRIGEELVDMTEGTGLTLPKAQPLVAKCSPRPKMAMLLSQI 
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FIG. 6 

SEQ ID 11 Dl3a-5 

1 GAAGGATTG GCTATTCGAA TGGTTGCATT GTCATTGGGA 
61 T GT AT T ATT C AATGCTTTGA TTGGCAACGA CTTGGGGAAG GATTGGTTGA TAAGACTGAA 
121 GGAACT GGAC TTACTTTGCC TAAAGCTCAA CCTTTAGTGG CCAAGTGTAG CCCACGACCT 
181 ATAAT GGCTA ATCTTCTTTC TCAGATTTGA 
SEQ ID 12 

EGLAIRMVALSLGCI IQCFDWQRLGEGLVDKTEGTGLTLPKAQPLVAKCSPRPIMANLLSQI 



FIG. 7 

SEQ ID 13 D56-AG10 

1 ATAGGTTTT GCGACTTTAG T GACACAT CT GACTTTTGGT 

61 CGCTTGCTTC AAGGTTTTGA TTTTAGTAAG C CAT CAAACA CGCCAATTGA CAT GACAGAA 

121 GGCGTAGGCG TTACTTTGCC TAAGGTTAAT CAAGT T GAAG TTCTAATTAC CCCTCGTTTA 

181 CCTTCTAAGC TTTATTTATT TTGA 
SEQ ID 14 

IGFATLVTHLTFGRLLQGFDFSKPSNTPIDMTEGVGVTLPKVNQVEVLITPRLPSKLYLF 



FIG. 8 

SEQ ID 15 D35-33 

1 ATAGGCTTT GCGACTTTAG T GACACAT CT GACTTTTGGT 
61 CGCTTGCTTC AAGGTTTTGA TTTTAGTAAG C CAT CAAACA CGCCAATTGA CAT GACAGAA 
121 GGCGTAGGCG TTACTTTGCC TAAGGTTAAT CAAGT T GAAG TTCTAATTAC CCCTCGTTTA 
181 CCTTCTAAGC TTTATTTAT 
SEQ ID 16 

IGFATLVTHLTFGRLLQGFDFSKPSNTPIDMTEGVGVrLPKWIQVEVLITPRLPSKLYL 



FIG. 9 

SEQ ID 17 D34-62 

1 ATAAATTTT GCGACTTTAG T GACACAT CT GACTTTTGGT 

61 CGCTTGCTTC AAGGTTTTGA TTTTAGTACG C CAT CAAACA CGCCAATAGA CAT GACAGAA 

121 GGCGTAGGCG TTACTTTGCC TAAGGTAAAT CAAGT GGAAG TTCTAATTAG CCCTCGTTTA 

181 CCTTCTAAGC TTTATGTATT CTGA 
SEQ ID 18 

INFATLVTHLTFGRLLQGFDFSTPSNTPIDMTEGVGVTLPKVNQVEVLISPRLPSKLYVF 



FIG. 10 

SEQ ID 19 D56AA7 

1 AT TAT ACT T GCATTGCCAA TTCTTGGCAT CACTTTGGGA 

61 CGTTTGGTTC AGAACTTTGA GCTGTTGCCT CCTCCAGGCC AGTCGAAGCT CGACACCACA 

121 GAGAAAGGTG GACAGTTCAG T CT C C AC AT T T T GAAG CAT T CCACCATTGT GTTGAAACCA 

181 AGGTCTTTCT GA 
SEQ ID 20 

IILALPILGITLGRLVQNFELLPPPGQSKLDTTEKGGQFSLHILKHSTIVLKPRSF 
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FIG. 11 

SEQ ID 21 D56-AE1 

1 ATTATACTT GCATTGCCAA TTCTTGGCAT TACTTT GGGA 
61 CGTTTGGTTC AGAACTTTGA GCTGTTGCCT CCTCCAGGCC AGTCGAAGCT CGACACCACA 
121 GAGAAAGGTG GACAGTTCAG TCTCCATATT TTGAAGCATT CCACCATTGT GTTGAAACCA 
181 AGGTCTTGCT GA 
SEQ ID 22 

IILALPILGITLGRLVQNFELLPPPGQSKLDTTEKGGQFSLHILKHSTIVLKPRSC 



FIG. 12 

SEQ ID 23 D35-BB7 

1 TATT GCACTT GGGGTTGCAT CAAT GGAACT TGCATTGTCA . 
61 AATCTTCTTT AT GC AT T T GA TTGGGAGTTA CCTTTTGGAA TGAAAAAAGA AGACATTGAC 
121 ACAAACGCCA GGCCTGGAAT T AC CAT G CAT AAGAAAAACG AACTTTATCT TATCCCTAAA 
181 AATTATCTAT AG 
SEQ ID 24 

I ALGVASMELALS NLL YAFDWELP FGMKKED I DTNARPGI TMHKKNEL YLI P KN YLP S KL YL F 



FIG. 13 

SEQ ID 25 D177-BA7 

1 ATTGCACTTG GGGTTGCATC CAT G GAACT T 

121 GCTTTGTCAA ATCTTCTTTA TGCATTTGAT TGGGAGTTAC CTTACGGAGT GAAAAAAGAA 

181 AACATTGACA CAAATGTCAG GCCTGGAATT AC CAT GCATA AG AAAAAC G A ACTTTGCCTT 

241 ATCCCTAGAA AT T AT CT ATA G 
SEQ ID 2 6 

IALGVASMELALSNLLYAFDWELPYGVKKENIDTNVRPGITMHKKNELCLIPRMYL 



FIG . 14 

SEQ ID 27 D5 6A-AB6 

1 GGTATTGCAC TTGGGGTTGC AT C CAT G G AA CTTGCTTTGT CAAATCTTCT TTATGCATTT 
61 GATT GGGAGT TGCCTTATGG AGTGAAAAAA GAAGACATCG AC ACAAAC GT TAGGCCT GGA 
121 ATTGCCATGC ACAAGAAAAA CGAACTTTGC CTTGTCCCAA AAAATTATTT ATAA 
SEQ ID 28 

IALGVASMELALSNLLYAFDWELPYGVKKEDIDTNVRPGIAMHKKNELCLVPKNYL 



FIG . 15 

SEQ ID 29 D144-AE2 

1 ATT GCACTT GGGG TTGCATCCAT G GAACT TGCT 

61 TTGTCAAATC TTCTTTATGC ATT T GATT GG GAGTTGCCTT AT GGAGT GAA AAAAGAAGAC 

121 AT C G AC ACAA ACGTTAGGCC TGGAATTGCC AT G C AC AAGA AAAAC GAACT TTGCCTTGTC 

181 CCAAAAAAAT T AT TTAT AAA TTATATTGGG AC GT G GAT CT CAT GC TAG 
SEQ ID 30 

IALGVASMELALSNLLYAFDWELPYGVKKEDIDTNVRPGIAMHKKNELCLVPKKLFINYIGTWISC 
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FIG. 16 

SEQ ID 31 D56-AG11 

1 ATTTCGTTT GGTTTAGCTA ATGCTTATTT GCCATTGGCT 
61 CAATTACTTT ATCACTTTGA TTGGGAACTC CCCACTGGAA TCAAACCAAG CGACTTGGAC 
121 TTGACTGAGT TGGTTGGAGT AACTGCCGCT AGAAAAAGT G ACCTTTACTT GGTT GCGACT 
181 CCTTATCAAC CTCCTCAAAA CTGA 
SEQ ID 32 

ISFGLANAYLPLAQLLYHFDWELPTGIKPSDLDLTELVGVTAARKSDLYLVATPYQPPQN 



FIG. 17 

SEQ ID 33 D179-AA1 

1 ATTTCGTTT GGCTTAGCTA ATGCTTATTT GCCATTGGCT 
61 CAATTACTAT AT C ACT T C GA TTGGAAACTC CCTGCTGGAA TCGAACCAAG CGACTTGGAC 
121 TTGACTGAGT TGGTTGGAGT AACTGCCGCT AGAAAAAGT G ACCTTTACTT GGTT GCGACT 
181 CCTTATCAAC CTCCTCAAAA GTGA 
SEQ ID 34 

ISFGIiANAYLPLAQLLYHFDWKLPAGIEPSDLDLTELVGVTAARKSDLYLVATPYQPPQK 



FIG. 18 

SEQ ID 35 D5 6-AC7 

1 ATGCTATTT GGTTTAGCTA ATGTTGGACA ACCTTTAGCT 
61 CAGTTACTTT AT CACTT C GA TTGGAAACTC CCTAATGGAC AAAGT CAT GA GAATTTC GAC 
121 AT GACT GAGT CACCTGGAAT TTCTGCTACA AGAAAGGATG ATCTTGTTTT GATTGCCACT 
181 CCTTATGATT CTT AT TAATT C CAGT CT A TATCATCTAT AT GTACT CAA TAATTGTATG 
3 61 GGA. 
SEQ ID 36 

MLFGLANVGQ P LAQL L YH FDWKLPNGQ S H EN FDMTES P G I SAT RKD D LVL I AT P YD S Y 



FIG. 19 

SEQ ID 37 D144-AD1 

1 AT GC TATTTGGTTT AGCTAAT GTT 
61 GGACAACCTT TAGCTCAGTT ACT TTATC AC TTCGATTGGA AACTCCCTAA. TGGACAAACT 
121 CACCAAAATT T CGACAT GAC TGAGTCACCT GGAATTTCTG CTACAAGAAA GGATGATCTT 
181 ATTTTGATTG CCACTCCTGC TCATTCTTGA 
SEQ ID 38 

MLFGLANVGQPLAQLLYHFDWKLPNGQTHQNFDMTESPGISATRKDDLILIATPAHS 



FIG. 20 

SEQ ID 39 D144-AB5 

1 TTAT TATTCGGTTT AGTTAAT GTA 
61 GGACATCCTT TAGCTCAATT GCTTTATCAC TTCGATTGGA AGACTCTTCC TGGGATAAGT 
121 TCAGATAGTT T C GAC AT GAC T GAAAC AG AT GGAGTAACTG CCGGAAGAAA GGATGATCTT 
181 TGTTTAATTG CTACTCCTTT TGGTCTCAAT TAA 
SEQ ID 40 

LLFGLWVGHPLAQLLYHFDWKTLPGISSDSFDMTETDGVTAGRKDDLCLIATPFGLN- 
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SEQ ID 41 D181-AB5 

1 A TGTCGTTTGG TTTAGTTAAC ACTGGGCATC CTTTAGCTCA 
61 GTTGCTCTAT TTCTTTGACT GGAAATTCCC TCATAAGGTT AATGCAGCTG ATTTTCACAC 
121 TACT GAAAC A AGTAGAGTTT TTGCAGCAAG CAAAGATGAC CTCTACTTGA TTCCAACAAA 
181 TCACATGGAG CAAGAGTAG 
SEQ ID 42 

MS FGLVNT GH P LAQLLYFFDWKFPHKVNAAD FHTTET S RVFAAS KDDL YLI PTNHMEQE 



FIG. 22 

SEQ ID 43 D73-AC9 

1 AT GTCGTTTGGT TTAGTTAACA CAGGGCATCC TTTAGCCCAG 
121 TTGCTCTATT GCTTTGACTG GAAACTCCCT GACAAGGT T A AT GCAAAT GA TTTTCGCACT 
181 ACT GAAACAA GTAGAGTTTT TGCAGCAAGC AAAGATGACC T CT ACT T GAT TCCCACAAAT 
241 CACAGGGAGC AAGAATAG 
SEQ ID 44 

MSFGLVNTGHPLAQLLYCFDWKLPDKVNANDFRTTETSRVFAASKDDLYLIPTNHREQE 



FIG. 23 

SEQ ID 45 D56-AC12 

1 ATGCAATTT GGTTTGGCTC TTGTTACTCT GCCATTGGCT 
61 CATTTGCTTC ACAATTTTGA TTGGAAACTT CCCGAAGGAA TTAATGCAAG GGATTTGGAC 
121 AT GACAGAG G CAAATGGGAT AT CT GCTAGA AGAGAAAAAG ATCTTTACTT GATTGCTACT 
181 C C T TAT GT AT CACCTCTTGA TTAA 
SEQ ID 46 

MQFGLALVTLPLAHLLHNFDWKLPEGINARDLDMTEANGISARREKDLYLIATPYVSPLD 



FIG. 24 

SEQ ID 47 D58-AB9 

1 ATGACTTAT GCATTGCAAG T GGAACAC CT AACAATGGCA 

61 CATTTGATCC AGGGTTTCAA TTACAGAACT CCAACT GAT G AGCCCTTGGA TAT GAAAGAA 

121 GGT GCAGGCA T AACT AT AC G TAAGGTAAAT CCTGTGAAAG TGATAATTAC GCCTCGCTTG 

181 GCACCTGAGC TTTATTAA 
SEQ ID 48 

MTYALQVEHLTMAHLIQGFNYRTPTDEPLDMKEGAGITIRKVNPVKVIITPRLAPELY 



FIG. 25 

SEQ ID 49 D56-AG9 

1 ATGACTTAT GCATTGCAAG T GGAACAC CT AACAATGGCA 

61 CATTTAATCC AGGGTTTCAA TTACAAAACT CCAAAT GAC G AGGCCTTGGA TATGAAGGAA 

121 GGT GCAGGCA TAACT AT AC G TAAGGTAAAT CCTGTGGAAC TGATAATAGC GCCTCGCCTG 

181 GCACCTGAGC TTTATTAA 
SEQ ID 50 

MTYALQVEHLTMAHLIQGFNYKTPNDE^LDMKEGAGITIRKWPVELIIAPRLAPELY 
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FIG. 26 

SEQ ID 51 D56-AG6 

1 ATGACTTAT GCATT GCAAG TGGAACACCT AACAATGGCA 
61 CATTTAATCC AGGGTTTCAA TTACAAAACT CCAAATGACG AGGCCTTGGA TATGAAGGAA 
121 GGT GCAGGCA TAACAATACG TAAGGTAAAT CCAGTGGAAT TGATAATAAC GCCTCGCTTG 
181 GCACCTGAGC TTTACTAA 
SEQ ID 52 

MTYALQVEHLTMAHLIQGFNYKTPNDEALDMKEGAGITIRKVNPVELIITPRLAPELY 



FIG. 27 

SEQ ID 53 D35-BG11 

1 ATGACTTAT GC AT T GCAAG TGGAACACTT AACAATGGCA 
61 CAT TT GAT C C AAGGTTTCAA TTACAGAACT CCAAATGACG AGCCCTTGGA TATGAAGGAA 
121 GGT GCAGGCA TAACT AT AC G TAAGGTAAAT CCTGTGGAAC TGATAATAGC GCCTCGCCTG 
181 GCACCTGAGC TTTATTAA 
SEQ ID 54 

MTYALQVEHLTMAHLIQGFNYRTPNDEPLDMKEGAGITIRKVNPVELIIAPRLAPELY 



FIG. 28 

SEQ ID 55 D35-42 

1 ATGACTTAT GCATTGCAAG TGGAACACTT AACAATGGCA 

61 CATTTGATCC AAGGTTTCAA TTACAGAACT CCAAATGACG AGCCCTTGGA TATGAAGGAA 

121 GGTGCAGGCA TAACT AT AC G TAAGGTAAAT CCTGTGGAAC TGATAATAGC GCCCCTGGCA 

181 CCTGAGCTTT ATTAA 
SEQ ID 56 

MTYALQVEHLTMAHLIQGFNYRTPNDEPLDMKEGAGITIRKYNPVELIIAPLAPELY 



FIG. 29 

SEQ ID 57 D35-BA3 

1 ATGACTTAT GCATTGCAAG TGGAACACTT AACAATGGCA 
61 CATTTGATCC AAGGTTTCAA TTACAGAACT CCAAATGACG AGCCCTTGGA TATGAAGGAA 
121 GGTGCAGGCA TAACT AT AC G TAAGGTAAAT CCTGCGGAAC TGATAATAGC GCCTCGCCTG 
181 GCACCTGAGC TTTATTAA 
SEQ ID 58 

MTYALQVEHLTMAHLIQGFNYRTPNDEPLDMKEGAGITIRKVNPAELIIAPRLAPELY 



FIG. 30 

SEQ ID 59 D34-57 

1 ATGACTTAT GC AT TACAAG TGGAACACCT AAC AAT AG C A 
61 CATTTGATCC AGGGTTTCAA TTACAAAACT CCAAATGACG AGCCCTTGGA TATGAAGGAA 
121 GGT GCAGGAT TAACCATACG TAAAGTAAAT CCTGTAGAAG T GACAACT AC GGCTCGCCTG 
181 GCACCTGAGC TTTATTAA 
SEQ ID 60 

MTYALQVEHLTIAHLIQGFNYKTPNDEPLDMKEGAGLTIRKVNPVEVTTTARIiAPELY 
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SEQ ID 61 D34-52 

1 ATGACTTAT G CAT T AC AAG TGGAACACCT AACAATAGCA 
61 CATTTGATCC AGGGTTTCAA TTACAAAACT CCAAATGACG AGCCCTTGGA TATGAAGGAA 
121 GGT GCAGGAT TAACTATACG TAAAGTAAAT CCTGTAGAAG TGACAATTAC GGCTCGCCTG 
181 GCACCTGAGC TTTATTAA 
SEQ ID 62 

MTYALQVEHLTIAHLIQGFNYKTPNDEPLDMKEGAGLTIRKVNPVEVTITARLAPELY 



FIG. 32 

SEQ ID 63 D34-25 

1 ATGACTTAT G CAT T AC AAG TGGAACACCT AACAATAGCA 
61 CATTTGATCC AGGGTTTCAA TTACAAAACT CCAAATGACG AGCCCCTGGA TATGAAGGAA 
121 GGT GCAGGAT TAACTATACG TAAAGTAAAT CCTGTAGAAG TGACAATTAC GGCTCGCCTG 
181 GCACCTGAGC TTTATTAA 
SEQ ID 64 

MT YALQVEH LT I AH L I Q GFN YKT P N DE P L DMKE GAGLT I RKVN P VEVT I T ARLAP ELY 



FIG. 33 

SEQ ID 65 D56AD10 

1 TATAGCCTT GGACTTAAGG TTATCCGAGT AACATTAGCC 
61 AACATGTTGC AT G GAT T C AA CTGGAAATTA CCTGAAGGTA T G AAG C C AG A AGATATAAGT 
121 GTGGAAGAAC ATTATGGGCT CACTACACAT CCTAAGTTTC CTGTTCCTGT GATCTTGGAA 
181 TCTAGACTTT CTTCAGATCT CTATTCCCCC ATCACTTAA 
SEQ ID 66 

YSLGLKVIRVTLANMLHGFNWKLPEGMKPEDISVEEHYGLTTHPKFPVPVILESRLSSDLYSPIT 



FIG. 34 

SEQ ID 67 D56-AA11 

1 ATACAGTCTT GGGATTCGTA TAATTAGGGC AACTTTAGCT 
61 AACTTGTTGC AT GGATT CAA CTGGAGATTG CCTAATGGTA TGAGTCCAGA AGACATTAGC 
121 AT G GAAGAGA TTTAT GGGCT AATTACACAC CCCAAAGTCG CACTTGACGT GAT GAT G GAG 
181 CCTCGACTTC CCAACCATCT TTACAAATAG 

SEQ ID 68 

YSLGIRIIRATIANLLHGFNWRLPNGMSPEDISMEEIYGLITHPKVALDVMMEPRLPNHLYK 
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SEQ ID 69 D177-BD5 

1 ATTAATTTTT CAATACCACT TGTTGAGCTT 

121 GCACTTGCTA AT CTATT GTT T CAT T AT AAT TGGTCACTTC CTGAAGGGAT GCTAGCTAAG 

181 GAT GT T GAT A TGGAAGAAGC TTTGGGGATT AC CAT G C AC A AGAAATCTCC CCTTTGCTTA 

241 GTAGCTTCTC ATTATACTTG TTGA 
SEQ ID 70 

INFSIPWELALANLLFHYNWSLPEGMLAKDVDMEEALGITMHKKSPLCLVASHYTC 
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SEQ ID 71 D56A-AG10 

1 ATGCAACTTG GGCTTTATGC AT T GGAAAT G GCTGTGGCCC ATCTTCTTCA TTGTTTTACT 
61 TGGGAATTGC CAGAT GGTAT GAAACCAAGT GAGCTTAAAA T G GAT GAT AT TTTT GGA.CTC 
121 ACTGCTCCAA AAGCTAATCG ACTCGTGGCT GTGCCTACTC CACGTTTGTT GTGTCCCCTT 
1.81 TATTAATTGA 
SEQ ID 72 

MQLGLYALEMAVAHLLHCFTWELPDGMKPSELKMDDIFGLTAPKANRLVAVPTPRLLCPLY 



FIG. 37 

SEQ ID 73 58-BC5 

1 ATGCAACTT GGGCTTTATG CAT T AGAAAT GGCAGTGGCC 
61 CATCTTCTTC TTTGCTTTAC TTGGGAATTG C CAGAT GGTA TGAAACCAAG T GAG CT T AAA 
121 AT GGAT GATA TTTTTGGACT CACTGCTCCA AGAGCTAATC GACTCGTGGC TGTGCCTAGT 
181 CCACGTTTGT TGTGCCCACT TTATTAA 
SEQ ID 74 

MQLGLYALEMAVAHLLLCFTWELPDGMKPSELKMDDIFGLTAPRANRLVAVPSPRLLCPLY 



FIG. 38 

SEQ ID 75 D58-AD12 

1 ATGCAACTT GGGCTTTATG CAT T GGAAAT GGCTGTGGCC 
61 CATCTTCTTC ATTGTTTTAC TTGGGAATTG C CAGAT G GT A TGAAACCAAG TGAGCTTAAA 
121 AT GGAT GATA TTTTTGGACT CACTGCTCCA AGAGCTAATC GACTCGTGGC TGTGCCTACT 
181 CCACGTTTGT TGTGTCCCCT TTATTAA 
SEQ ID 76 

MQLGL YALEMAVAHLLHC FTWEL P DGMKP S ELKMDDI FGLTAP RANRLVAVPT P RLLCP L Y 
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SEQ ID 77 D5 6-AC11 

1 ATGCTTTGG AGTGCGAGTA TAGTGCGCGT CAGCTACCTA 
61 ACTTGTATTT ATAGATTCCA AGTATATGCT GGGTCTGTGT TCAGAGTAGC ATGA 
SEQ ID 78 

MLWSAS I VRVS YLT C I Y RFQVYAG S VFRVA 



FIG. 40 

SEQ ID 79 D35-39 

1 ATGCTTTGG AGTGCGAGTA TAGTGCGCGT CAGCTACCTA 
61 ACTTGTATTT ATAGATTCCA AGTATATGCT GGGTCTGTGT TCAGAGTAGC ATGA 
SEQ ID 80 

MLWSAS IVRVS YLT CI YRFQVYAGS VFRVA 
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SEQ ID 81 D58-BH4 

1 ATGCTTTGG AGTGCGAGTA TAGTGCGCGT CAGCTACCTA 
61 ACCTGTATTT ATAGATTCCA AGTATATGCT GGGTCTGTGT TCAGAGTAGC ATGA. 
SEQ ID 82 

MLWS AS IVRVS YLT C I YRFQVYAG S VFRVA 
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SEQ ID 83 D177-BD7 

1 AT T AAT TT T T CAATACCACT TGTTGAGCTT GCACTTGCTA AT CT AT T GT T T CAT T AT AAT 

61 TGGTCACTTC CTGAGGGGAT GCTACCTAAG GAT GT T GAT A TGGAAGAAGC TTTGGGGATT 

121 AC CAT G C AC A AGAAATCTCC CCTTTGCTTA GTAGCTTCTC ATTATAACTT GTTGTGA 
SEQ ID 84 

INFSIPLVELALANLLFHYNWSLPEGMLPKDVDMEEALGITMHKKSPLCLVASHYNLL 



FIG. 43 
SEQ ID 85 

D17 6-BF2 

1 AT ATCATTTGGT TTGGCTAATG TTTATTTGCC ACTAGCTCAA 
121 T T GT TAT AT C ATTTTGATTG GAAACTCCCT ACTGGAATCA ATTCAAGTGA CTTGGACATG 
181 ACT GAGT CGT CAGGAGTAAC TTGTGCTAGA AAGAGT GAT T TAT ACT T GAC TGCTACT CCA 
241 TAT CAACTT T CTCAAGAGTG A 
SEQ ID 86 

GISFGLANVYLPLAQLLYHFDWKLPTGINSSDLDMTESSGVTCARKSDLYLTATPYQLSQE 
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SEQ ID 87 D56-AD6 

1 ATGCTTTGG AGTGCGAGTA TAGTGCGCGT CAGCTACCTA 
61 ACTTGTATTT ATAGATTCCA AGTATATGCT GGGTCTGTGT C CAGAGT AG C ATGA 
SEQ ID 88 

MLWS AS IVRVS YLT CI YRFQVYAGS VS RVA 



FIG. 45 

SEQ ID 89 D73A-AD6 

1 CT GAATTTTGCA AT GTTAGAGG CAAAAATGGC ACTTGCATTG 
121 ATTCTACAAC ACTATGCTTT TGAGCTCTCT CCATCTTATG CACATGCTCC T CAT AC AAT T 
181 ATCACTCTGC AACCTCAACA TGGTGCTCCT TTGATTTTGC GCAAGCT GTA G 
SEQ ID 90 

LNFAMLEAKMALALILQHYAFELSPSYAHAPHTIITLQPQHGAPLILRKL 
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SEQ ID 91 D70A-BA11 

1 CT GAATTTTGCA ATGTTAGAGG CAAAAATGGC ACTTGCATTG 
121 ATTCTACAAC ACTATGCTTT TGAGCTCTCT C C AT CT TAT G CACACGCTCC T CAT ACAATT 
181 ATCACTCTGC AACCTCAACA TGGTGCTCCT TTGATTTTGC GCAAGCTGTA G 
SEQ ID 92 

LNFAMLEAJKMALALILQHYAFELSPSYAHAPHTIITLQPQHGAPLILRKL 



FIG. 47 • 

SEQ ID 93 D70A-BB5 

1 AA TAATTTTGCA ATGTTGGAAA. CTAAGATTGC CTTAGCAATG 
121 AT C CT AC AG C GTTTTGCTTT CGAGCTTTCT C CAT C TT AC G CTCATGCACC TACTTATGTC 
181 GTCACTCTTC GACCTCAGTG TGGTGCTCAC TTAATCTTGC AAAAATTATA GGTCCTTAAT 
241 CTGGATTTCC CAT TAT T GAG TAGTGCCTAA TAAATCTTCT CTAT CACTAT TTTTCCATCT 
301 TTCA 
SEQ ID 94 

NN FAMLETKIALAMI LQRFAFELS P S YAHAPT YWTLRPQCGAHLI LQKL 



FIG. 48 

SEQ ID 95 D70A-AB5 

1 AGCGAAGGGG TGGCAAAGGC AACAAAGGGG AAAAT G AC AT ATTTTCCATT TGGTGCAGGA 
61 CCGCGAAAAT GCATTGGGCA AAACTTCGCG ATTTTGGAAG CAAAAATGGC TAT AGCT AT G 
121 ATTCTACAAC GCTTCTCCTT CGAGCTCTCC CCAT CTTATA CACACTCTCC ATACACTGTG 
181 GTCACTTTGA AACCCAAATA TGGTGCTCCC CTAATAATGC ACAGGCTGTA GT C CT GT GAG 
241 AATAT G CT AT CCGAGGAATT CAGTTCCT 
SEQ ID 96 

QN FAI liEAKMAI AMI LQRFSFSLSPS YT K S P YT WT L K P K YGAP L I MH PvL 



FIG. 49 

SEQ ID 97 D70A-AA8 

1 AGCGAAGGGG TGGCAAAGGC AACAAAGGGG AAAAT G AC AT ATTTTCCATT TGGTGCAGGA 
61 CCGCGAAAAT GCATTGGGCA AAACTTCGCG ATTTTGGAAG CAAAAATGGC TATAGCTATG 
121 ATTCTACAAC GCTTCTCCTT CGAGCTCTCT CCAT CT TATA CACACTCTCC ATACACTGTG 
181 GTCACTTTGA AACCCAAATA TGGTGCTCCC CTAATAATGC ACAGGCTGTA GTCCTGT 
SEQ ID 98 

QNFAILEAKMAIAMILQRFSFELSPSYTHSPYTWTLKPKYGAPLIMHRL 



FIG. 50 

SEQ ID 99 D70A-AB8 

1 C AAAATTTTGC CAT GT T AGAA GCAAAGATGG CTCTGTCTAT GATCCTGCAA 
121 CGCTTCTCTT TTGAACTGTC TCCGTCTTAT GCACATGCCC CTCAGTCCAT ATTAACCGT 
181 CAGCCACAAT ATGGTGCTCC ACTTATTTTC CACAAGCTAT AA 
SEQ ID 100 

QNFAMLEAKMALSMILQRFSFELSPSYAHAPQSILTVQPQYGAPLIFHKL 
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SEQ ID 101 D70A-BH2 

1 AT AAACTTTGCA ATGACAGAAG CGAAGATGGC TATGGCTATG 
121 ATTCTGCAAC GCTTCTCCTT TGAGCTATCT CCAT CTTACA CACATGCTCC ACAGTCT GTA 
181 ATAACTATGC AACCCCAATA TGGTGCTCCT CT T AT AT T G C ACAAATT GTA A 
SEQ ID 102 

INFAMTEAKMAMAMILQRFSFELSPSYTHAPQSVITMQPQYGAPLILHKL 



FIG. 52 

SEQ ID 103 D70A-AA4 

1 AT AAACTTTGCA ATGGCAGAAG C GAAGAT GGC TATGGCTATG 
121 ATTCTGCAAC GCTTCTCCTT TGAGCTATCT CCAT CTTACA CACATGCTCC ACAGTCT GTA 
181 ATAACTATGC AACCCCAATA TGGTGCTCCT CTTATATTGC ACAAATT GTA A 
SEQ ID 104 

INFAMAEAKMAMAMILQRFSFELSPSYTHAPQSVITMQPQYGAPLILHKL 



FIG. 53 

SEQ ID 105 D70A-BA1 

1 CA AAACTTTGCA AT GAT GGAAG C AAAAAT GGC AGTAGCTATG 
121 ATACTACAAA AATTTTCCTT TGAACTATCC CCTT CTTATA CACATGCTCC ATTTGCAATT 
181 GTGACTATTC ATCCTCAGTA TGGTGCTCCT CTGCTTATGC GCAGACTTTA A 
SEQ ID 106 

QNFAMMEAKMAVAMILQKFSFELSPSYTHAPFAIVTIHPQYGAPLLMRRL 



FIG, 54 

SEQ ID 107 D70A-BA9 

1 CA AAACTTTGCA ATGATGGAAG C AAAAAT GGC AGTAGCTATG 
121 ATACTACATA AATTTTCCTT TGAACTATCC CCTT CTTATA CACATGCTCC ATTTGCAATT 
181 GTGACTATTC ATCCTCAGTA TGGTGCTCCT CTGCTTATGC GCAGACTTTA A 
SEQ ID 108 

QNFAMMEAKMAVAMILHKFSFELPSYTHAPFAIVTIHPQYGAPLLMRRL 



FIG. 55 

SEQ ID 10 9 D70A-BD4 

1 CA AAATTTTGCT AT GTT AGAGG CTAAAATGGC AAT GGCTAT G 
121 ATTCTGAAAA CCTATGCATT TGAACTCTCT C CAT CT T AT G CTCATGCTCC TCATCCACTA 
181 CTACTTCAAC CT C AAT AT G G TGCTCAATTA ATTTTGTACA AGTTGTAG 
SEQ ID 110 

QNFAMLEAKMAMAMILKT YAFELS P S YAHAPHP LLLQPQYGAQLI LYKL 



WO 03/078577 



12/31 



PCT/US03/07430 



FIG. 56 

SEQ ID 111 D181-AC5 

1 TAT AG CAT G G GGCTCAAGGC GAT T CAAGCT AGCTTAGCTA 
61 ATCTTCTACA TGGATTTAAC TGGTCATTGC CT GATAAT AT GACTCCTGAG GACCTCAACA 
121 TGGATGAGAT TTTTGGGCTC T CT AC AC CT A AAAAATTTCC ACTTGCTACT GT GAT T GAG C 
181 CAAGACTTTC AC C AAAACT T TACTCTGTTT GA 
SEQ ID 112 

YSMGLKAI QASLANLLHGFNWSLPDNMTPEDLNMDEIFGLSTPKKFPLATVI EPRLSPKLYSV 
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SEQ ID 113 D144-AH1 

1 TAT AGCTTGGGGC TCAAGGAGAT TCAAGCTAGC 
61 TTAGCTAATC TTCTACATGG ATTTAACTGG TCATTGCCTG ATAATATGAC TCCTGAGGAC 
121 CTCAACATGG ATGAGATTTT TGGGCTCTCT ACACCTAAAA AATTTCCACT TGCTACT GTG 
181 AT T GAG C C AA GACTTTCACC AAAACTTTAC TCTGTTTGA 
SEQ ID 114 

YSLGLKEIQASLANLLHGFNWSLPDNMTPEDLNMDEIFGLSTPKKFPLATVI EPRLSPKLYSV 
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SEQ ID 115 D34-65 

1 CATAGCTTG GGGCTCAAGG TGATTCAAGC TAGCTTAGCT 
61 AATCTTCTAC ATGGATTTAA CTGGTCATTG CCTGATAATA TGACTCCTGA GGACCT CAAC 
121 ATGGATGAGA TTTTTGGGCT CTCTACACCT AAAAAATTTC CACTTGCTAC T GT GAT T GAG 
181 CCAAGACTTT CACCAAAACT TTACTCTGTT TGA 
SEQ ID 116 

HSLGLKVIQASLANLLHGFNWSLPDNMTPEDLNMDEIFGLSTPKKFPLATVI EPRLSPKLYSV 
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SEQ ID 117 D35-BG2 

1 CTGTGCTTT C CAT GT T T AA TCTCTAGTTA TATACTGGCT 
61 TTGAATGTGA ATCTGTATCA TAATTTCTTG CAAATTTCTC CTTCCATTTC TTATTAA 
SEQ ID 118 

LCFPCLISSYILALNVNLYHNFLQISPSISY 



FIG. 60 

SEQ ID 119 D73A-AH7 

1 TCTG GACTTGCTCA ATGTGTGGTT GGTTTAGCTT TAGCAACTCT AGTGCAGTGT 
121 TTTGAGTGGA AAAGGGTAAG CGAAGAGGTG GTTGATTTGA CGGAAGGAAA AGGTCTCACT 
181 ATGCCAAAAC CCGAGCCACT CAT GG CTAG G TGCGAAGCTC GTGACATTTT TCACAAAGTT 
241 CTTTCAGAAA TATCTTAA 
SEQ ID 120 

SGLAQCWGLALATLVQCFEWKRVSEEWDLTEGKGLTMPKPEPLMARCEARDIFHKVLSEIS 
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SEQ ID 121 D58-AA1 

1 TTGGGCTTG GCAACGGTGC ATGTGAATTT GATGTTGGCC 

61 CGAATGATTC AAGAATTTGA ATGGTCCGCT TACCCGGAAA AT AG GAAAGT G GAT T T TACT 

121 GAGAAATTGG AATTTACTGT GGTGATGAAA AATCCTTTAA GAGCTAAGGT CAAGCCAAGA 

181 ATGCAAGTGG TGTAA 
SEQ ID 122 

LGIATVHVNLMLARMIQEFEWSAYPENRKVDFTEKL^ 
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SEQ ID 123 D73A-AE10 

1 TATGCTT TGGCTATGCT TCATTTAGAG 

121 TACTTTGTGG CTAATTTGGT TTGGCATTTT CGATGGGAGG CTGT GGAGGG AGATGAT GTT 
181 GATCTTTCAG AAAAGCTAGA ATTCACCGTT GTGATGAAGA ATCCACTTCG AGCTCGTATC 
241 TGCCCCAGAG TTAACTCTAT TTGA 
SEQ ID 12 4 

YALAMLH LE Y FVAN LVWH FRWEAVE GD D VD L S E KL E FTWMKN P LRARI C P RVN S I 
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SEQ ID 125 D56A-AC12 

1 GGTCAGCAAG TTGGACTTCT TAGAACAACC ATTTTCATCG C CT CAT TACT GTCTGAATAT 
61 AAGCTGAAAC CTCGCTCACA CCAGAAACAA GTTGAACTCA CCGATTTAAA TCCAGCAAGT 
121 TGGCTTCATT CGATAAAAGG CGAACTGTTA GTCGATGCGA TTCCTCGAAA GAAGGCGGCA 
181 TTTTAA 
SEQ ID 12 6 

GQQVGLLRT T I FI AS LLS EYKLKP RS HQKQVELT D LN PAS WLHS I KGELLVDAI P RKKAAF 
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SEQ ID 127 D177-BF7 

1 AT C AC AT T T G CTAAGTTTGT GAATGAGCTA 
121 GCATTGGCAA GATTAATGTT CCATTTTGAT TTCTCGCTAC CAAAAGGAGT TAAGCAT GAG 
181 GATTTGGACG TGGAGGAAGC TGCTGGAATT ACT GT T AGAA GGAAGTTCCC CCTTTTAGCC 
241 GTCGCCACTC CATGCTCGTG A 
SEQ ID 128 

ITFAKFVNELAIARLMFHFDFSLPKGVKHEDLDVEEAAGITVRRKFPLLAVATPCS 



FIG . 65 

SEQ ID 129 D73A-AG3 

1 CA GAGGTATGCT ATAAACCATT TGATGCTCTT TATTGCGTTG 

121 TTCACGGCTC T GATT GATTT CAAGAGGCAC AAAACGGACG GCTGTGATGA TATCGCGTAT 

181 ATTCCAACCA TTGCTCCAAA GGATGATTGT AAAGTGTTCC TTTCACAGAG GTGCACTCGA 

241 TTCCCATCTT TTTCATGA 
SEQ ID 130 

QRYAINHLMLFIALFTALIDFKRHKTDGCDDIAYIPTIAPKDDCKVFLSQRCTRFPSFS 
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SEQ ID 131 D70A-AA12 

1 ATG TCATTTGGTT TAGCTAAT CT TTACTTACCA TTGGCTCAAT 

121 TACTCTATCA CTTTGACTGG AAACTCCCAA CCGGAATCAA GCCAAGAGAC TTGGACTTGA 

181 C C GAATT AT C GGGAATAACT ATTGCTAGAA AGGGTGACCT TTACTTAAAT GCTACTCCTT 

241 ATCAACCTTC TCGAGAGTAA 
SEQ ID 132 

MSFGLANLYLPLAQLLYHFDWKLPTGIKPRDLDLTELSGITIARKGDLYLNATPYQPSRE 
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SEQ ID 133 D185-BC1 

1 TTGGGCTTG GCAACGGTGC AT GT GAATT T GATGTTGGCC 
61 CGAACGATTC AAGAATTTGA ATGGTCCGCT TACCCGGAAA ATAGGAAAGT GGAT T T t ACT 
121 GAGAAATTGG AATTTACTGT GGTGATGAAA AACCCTTTAA GAGCTAAGGT CAAGCCAAGA 
181 ATGCAAGTGG TGTAA 
SEQ ID 134 

LGLAT VHVNLMLART I QE FEW SAYPEN RKVD FTEKLE FTWMKN P LRAKVK P RMQW 
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SEQ ID 135 D185-BG2 

1 TTGGGCTTG GCAACGGTGC ATGTGAATTT GATGTTGGCC 

61 CGAATGATTC AAGAATTTGA ATGGTCCGCT TACCCGGAAA ATAGGAAAGT G GAT T TACT G 
121 AGAAATTGGA ATTTACTGTG GTGA 
SEQ ID 13 6 

LGIATVHVNLMLAPMIQEFEW SAYPEN RKVDLLRNWNLLW 
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SEQ ID 137 D185-BE1 

1 ATCACATTT GCTAAGTTTG TGAATGAGCT AGCATTGGCA 
61 AGATTAAT GT TCCATTTTGA TTTCTCGCTA CCAAAAGGAG TTAAGCATGA GGATTTGGAC 
121 GTGGAGGAAG CTGCTGGAAT TACT GTTAGG AGGAAGTTCC CCCTTTTAGC CGTCGCCACT 
181 CCATGCTCGT GA 
SEQ ID 138 

ITFAKFVNEIiALARLMFHFDFSLPKGVKHEDLDVEEAAGITVRRKFPLLAVATPCS 



FIG. 7 0 

SEQ ID 139 D185-BD2 

1 ATCACATTT GCTAAGTTTG TGAATGAGCT AGCATTGGCA 

61 AGATTAAT GT TCCATTTTGA TTTCTCGCTA CCAAAAGGAG TTAAGCATGC GGATTTGGAC 

121 GTGGAGGAAG CTGCTGGAAT TACTGTTAGA AGGAAGTTCC CCCTTTTAGC CGTCGCCACT 

181 CCATGCTCGT GA 
SEQ ID 140 

ITFAKFVNELALARLMFHFDFSLPKGVKHADLDVEEAAGITVRRKFPLIAVATPCS 
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SEQ ID 141 D176-BG2 

1 CA AAATTTTGCC ATGTTAGAAG CAAAGACTAC TTTGGCTATG 
121 ATCCTACAAC GCTTCTCCTT TGAACTGTCT CCATCTTATG CACATGCTCC T C AGT C CAT A 
181 ATAACTTTGC AACCCCAGTA TGGTGCTCCA CTTATTTTGC ATAAAATATA G 
SEQ ID 142 

QNFAMLEAKTTLAMILQRFSFELSPSYAHAPQSIITLQPQYGAPLILHKI 
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SEQ ID 143 D185-BD3 

1 ATTATCCTT GCACTGCCAA TTCTTGGCAT TACCTT GGGA 
61 CGCTTGGTGC AGAACTTTGA GTTGTTGCCT CCTCCAGGAC AGTCAAAGCT TGACACAACA 
121 GAGAAAGGCG GGCAATTCAG TCTGCACATT TTGAAGCATT CCACCATTGT GATGAAACCA 
181 AGATCTTTTT AA 
SEQ ID 144 

IILALPILGITLGRLVQNFELLPPPGQSKLDTTEKGGQFSLHILKHSTIVMKPRSF 



FIG. 73 

SEQ ID 145 D176-BC3 

1 C AAAATTTTGC CAT GT T AGAA GCAAAGACTA CTTTGGCTAT 

121 GAT C CT AC AA CGCTTCTCCT TTGAACTGTC TCCATCTTAT GCACATGCTC CT C AGT C CAT 

181 AATAACTT GC AACCCCAGTA TGGTGCTCCA CTTATTTTGC ATAAAATATA GTTTATTACT 

241 T GTAAGT AGT GTCTCGTTTT ATGTTAAGCA TGAGTCCAAA ATGTTAAGGC TTGTAGAACT 

3 01 GCAAAATGGG AATGCATTTG CACTCGTGCA CTGTAGATTG TTGTAA 
SEQ ID 146 

QNFAMLEAKTTLAMILQRFS FELSPSYAHAPQSIITCNPSMVLHLFCIKYSLLLVSSVSFYVKHESKMLRLVELQNGNA 
FALVKCRLL 



FIG. 7 4 

SEQ ID 147 D176-BB3 

1 GCTGAT 

61 ATGGGGTTGC GAGCAGTTTC TTTGGCATTA GGT GCACTTA TTCAATGCTT TGACTGGCAA 

121 AT T GAGGAAG CGGAAAGCTT G GAG GAAAG C TATAATTCTA GAAT GACTAT GCAGAACAAG 

181 CCTTTGAAGG TTGTCTGCAC TCCACGCGAA GATCTTGGCC AGCTTCTATC CCAACTCTAA 
SEQ ID 148 

ADMGLRAVSLALGALIQCFDWQIEEAESLEESYNSRMTMQNKPLKWCTPREDLGQLLSQL 
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Figure : Amino Acid Identity of Group Members 



Group 1 

. AQLi^NLVTSMIiGKLLKKFTWAPAPGVNPEB IDLEESFGTVTYWKNPIQAI FTPRLPAHLYGRVPVDM 

I 

AQIiAINXiVT SM1GHLI*HHFTWAP P P GVNP2 NT DLE ESP GTVTYMKNP I QAI PTPRLPAHLYGRVPVDM 



SEQ XD Wo, :2 D58-JBG7 
(99-5) 

SEQ ID No.:4 D5B- ABl 



Group 2 

QLATNLVTS ML GHLiFI I IjHGLRPS.GIjTR^I L TWRKAIjE Q 



SEQ 1X> No'. :8 D5B-BE4 



Group 3 

EGLAVRMVALSI^GCI 2 QCFDWQRI GESl.VDMTEGTGLTXjPiCAQPLV^KCS PRPXiMANLLSQI 

I I I I 

EGLAIRMVALSXGCI IQCPDWQRLGESLVDKTSGTGIjT^ 



SEQ ID NO..-10 P5S-JUi7 
(33. 5) 

S£Q ID . NO - :12- D13a-5 



Group 4 

IGFATLVTHIjTFGRIMGFDFSKPSN^ 

l I 11 

IOTATLVTHLTFGRIJ-QGFDFSTPSNT^^ 



SEQ IP No. =14 D5G-AG10 
(53,3) 

SEQ ID NO. =18 D34-62 



Group 5 

IIIAIjPI^GITiGRLVQN^ 

• 1 

III^PILGITXjGHIjVQNFEtiPPPGOSKLiDTrEKGGQFSIJIIIiKHSTIVllKPIiS? 

II 

XIIiAIiPILGrTLGJO^VQIs^ 



• SEQ ID *To,-2Q D5 6-AA? 

(99.2) 

SSQ ID No. :144 D1S£-BD3 
(96-4), 

SEQ ID No- =22 D5 6-A51 



Group 6 

' IALGVASMEI^XSl^LXM^WBLPFGMKKEDI 

II- I I 

I I 

1 1 ( 

IAIjGVASMELMjCNIxL^ 



SEQ ID UO-:24 D3 5-BB7 
(52. a) 

SEQ ID No;s26 D177-BA7 
(96.4) 

SEQ ID Ko.;28 D5SA- AB£ 
(24-6) 

SEQ ID No.:3Q D144-AE2 



Group 7 

XSFGDANAYLPIAQIiIiYHFDW^ 

III 1 

iSFGIiANAYIiP^QI^^^ 



SEQ ID No. *32 D5S-AG11 
(93.3) 

SEQ ID No.:34 D17S-AA1 



Group 8 

^FGLANVGQPIAQIiLYHFira 

II i ll 



SEQ ID No.s36 D56-AC7 
(SI. 2) 

SEQ ID tfo. :38 D144-AD1 



Group S 

IJ^FGnVNVGHPIAQI^YHFDP^ 



SEQ ID No- .-40 I3Z44-AB5 
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Figure : Amino Acid Identity of Group Members 



Group 10 

! I I i I I 



SEQ ID NO. s 42 D131-AB5 
(99.8) 

SEQ ID No.:44 D73-Ac9 



Group 11 

MQFGIoALVTLPIAHLL^ SARRSKDLYLI ATPYVSPLD 



SEQ ID No. D5S-AC12 



Group 12 

mYALQVEKLTWAKLIQGFNYXTPTDEPIiDM 

III II I 

MTyALQVEHLTMAHLI QGFNyXTPNDEALDMKEGAGI TIRJCVNPVEIjI ITPRIiAPETY 

II I 
MTY&LQVEHLTM&HIjI QGFNYRTPNDE5LDMKEG&GX TIRICVNPVELI I APRI^APELY 

■ MTYALQVEHLTM?^HIjI QGFNYRTPtfDE PIDMKEGAGI T IRKVUPVELI IAP - LAPELY 

' MTyALQVEHI/EMAm^ 

I | 1 MUM 

' MTYAI^ VK^TXAHXjI Q GFNYKT ?ND EPIDKK3 G AGIkTI RKVNP VEVTTT ARIAP EIj Y 

I 

^ MTYALQVEHX/TIAI^IQGFi>TYCT 



SEQ ID No.: 48 D5B-A32 
(89.6) 

SEQ ID No.:50 D5S-AG9 
(98.2) 

' S£Q ID No. ,-52 D56-AG£ 
(94. B) 

SEQ ID No- =54 D35-BG11 
(98.3) 

SEQ ID No. iS6 D35- 42 
(98.3) 

SEQ ID No.:58 D35-BA3 
(84. S) 

SEQ D34-57 
(58.3) 

SEQ ID No. :62 D3 4-52 



Group 13 . 

YSLGLKVIRVTLANML^^ 



SEQ ID No. :65 D55-ADI0 



Group 14 

YSLGIRIIRATIJUSrjI^GFNW^ 



SEQ ID No . =66 D5&-AA11 



Group 15 

XNFSI PLVELALJUSJI^FHYNWSL^ 
IOTSIPLV^IJO^ANIiLFHYlSlWSLPEGMI^ 



SEQ ID No. =70 D177-BDS 
(94.7] 

SEQ ID No. t 84 D177-BD7 



Group 1 6 

MQIiGLYAIjE^V^^IaljIiCFTWE^ PD GMKPS ELKMDD X FGLTAPKANRL VAVP SPRLi C PIj Y 

" 1 I 

MQLGLYAliE^VAHIaljHCFTWEIjPDGMKP 

MQLGLYALE^VAm^ 



SEQ ID NO. = 74 D5e-BC5 
(56. 7) 

SEQ ID STo.i76 DSS~AD12 
(98. 4) 

SEQ ID No. j 72 D56A-AG10 



Group 17 

MLWS ASI VfcVS YI/TCI YRFQVTAGSVFRVA 

I 

MLWSASIVRVSYLTCIYRFQVYAGSV5RVA 



SEQ ID No.:7fi D56-AC11 
(95.7) 

SEQ ID No.:B8 D55- AD6F 
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Figure : Amino Acid Identity of Group Members 



Group 18 

IiNFAMLSAKM^LAL I XjQHYAFEL £ P S YAHAPETI I XLQ P QHGAFL I KRKL 

I Mill 1 1 1 1 i ! I 

NNFAI^ETKIAIAMILQ3^AFELSPSYAEA 



SEQ ID NO - :90 D73A-AD6 
(74-0) 

SEQ ID No. :94 D7 0A-BB5 



Group 19 

QNFAILEAKMAIAMI LQRFS FELSPSYTHS P^TTWTL2C?KYCAPIjIHfi£RIj 

| |1 | I I I I I i I 1 II 

QNFi^EAKKALSMILQRFSFE 

It 1 I I i 

QKT?i^EAKTTX^MILQRFSFEL5PSYAHAPQSI ITLQPQYGAPLILHKI 

MFAmteak^ 

INPAwLlAKKkl^ 

I .] I ! 1 .111111 llll • 

QNFAMMEAKMAVAMI;^ 



SEQ ID NO- iBS 

SEQ ID NO. : 100 

SEQ ID No. : 142 

SEQ ID No. ;102 

SEQ ID No.: 104 

SEQ ID NO. : 10 3 

SEQ ID Mo.:106 



D70A-AB5 

(72.0) 
D70A-AB8 

(88.0) 
D176-BG2 

(86. 0) 
D7 0A-BH2 

(98.0) 
D7 OA-AA4 

(70. 0) 
D70A-BA$ 

(5a. o) 

E>7 OA-BA1 



Group 20 

QftFAMLJCAKMAMAMI LKTYAFSLS PSYjVHAPKPLIjL - QPQYGAQLILYKL 



SEQ ID No. i 110 D70A-BD4 



Group 21 

YSMGLXAIQASIANI^^ 

YsicLKBIQASIAI^LHGFNWSLPDI^ FGLSTPKKFPI^TVIEPRDSPKLYS V 

AsLGI^IQ^ 



SEQ ID No-: 112 D181-AC5 
(96.3) 

SEQ ID NO.: 114 D 144 -AMI 
(S6. 8) 

SEQ ID NO. r 11^ 



Group 22 

DCFPCX.I S S YI LALNVNLYHNFIaQI SP S I S Y 



SEQ ID NO-:118 D35-BG2 



Group 2 3 

SGLAQCWGI^^ 



SEQ ID NO.: 120 D72A-AH7 



Group 24 

IoGIAFVHVNLMIjA^ ' 



SEQ ID NO.: 136 D185-BG2 
(77.5) 

• SEQ ID No. zl22 D58-AA1 
(98-2) 

SEQ ID No.: 134 DI8S-BC1 
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Figure : Ajcoino Acid Identity of Group Members 

Group 25 

YAIiAMLHLiEYF^^ I SEQ IP JJo . : 124 D73A-AE10 

Group 26 

QQVGlJjRTTIFIASIiXiSEyKL^ P3.2GCAAF SEQ ID No.: 126 D5SA-AC12 

Group 27 

:• 1 

ITFAKF Vb^IAXJ^iMFHFD FSLPKGVKHADLDVE J2AAGI TVRZKFPLIAVATP CS 



SSQ ID No.; 128 D177-BF7 
(98-2J 

- SEQ ID No. s 140 D1B5-BD2. 



Group 28. 

QRYAINHD^FIALFTALIDFKRHKTO 



SEQ ID No . . 13 0 D73A-AG3 



Group 29 

FGL&NlAfLPIAQI»IiYHFDWKIjPT^ SRE 

I • III I I I I I I II 

.ISFGLANVTLPIAQIira^ 



SEQ ' ID No.: 13 2 E70A-AA12 

(80. 0) 

SEQ ID NO- :186 17S-BF2 



Group 3 0 SEQ ID No. D17S- BC3 



Group 31 

ADMGLRAVSLALGAIXQCFDWQI^ 



SEQ ID No.:148 D176-BB3 



Group 32 

MOTSLQVEHLSIAKMI QGFSFATTTNEPiDMKQGVGLTLPKXTDVEVLITPRIjPiPTLYQY 



SSQ ID No. : 6 I>18 6-AH4 



The percentage identity between most related pairs is noted in (0.0%). Each group had at 
least 70% identity to another group member. Group 19 contained the lowest percentage 
identity at 70-0%. 
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P'/G. A/a H ; C 



Group 1 

D58-BG7 
D53-BE4 

D35-3BF 



pA(t( r^w of y&due^ct? 



GCACAACTTGCTAT CAACTTGGTCACATCTATGTTGGGTC^TTTGTTG CAT CATTTTACA 
GCACAACTTGCTATC^CrrGGTCACATCTATGTTGGGTC^a?a7TGrr- CATCATTTTACA ' 
GCACAACTTGCTAT CAACTTGGT CACAT CTATGTTGGGT CATT TGTT GCAT CATT TT ACG 
GCACAACTTGCTATCAACTTGGTCACATCTATGTTGG^ 



D58-BG7 
D5B-BE4 
D5B-AB1 
D35-38F 



TGGGCTCCGGCCCCGGGGGTTAACCCGGAGGATATTGACTTGGAGGAGAGCCCTGGAACA 
TGGGCTCCGGCCCCGGGGG1TAACCCGGAGGATATTGACTTGGAGGAGAGCCCTGGAACA 
TGGGCTCCGCCCCCGGGGGTTAACCCGGAGAATATTGACTTGGAGGAGAGCCCTGGAACA 
TGGGCT CCGCCCCCGGGGGTTAACC CGGAGAAIATTGACTTGGAGGAGAGCC CTGGAACA 



D58-BG7 
D58-BE4 
D5 8-AB1 
D25-3 8F 

D58-BG7 
D5 8-BE4 
DS3-A31 
D35-3SF 



GTAACT TAC ATGAAAAATCCAATAC^JVGCTATTC CAACT C CIAAGATTG C CTGCACACTTG 

imiiiiiiiiiiiiiiiiiiiiiiiimiiiiiiiiiiiiiiii 

GTAACTTACATGA • ' ' j ■' 

GTAACTTAGATGAAAAAT CCAATACAAGC TATTCCT ACT C CAAGAITGC CTGCACACTTG 

GTAAC TTACATGAAAAAT C CAATACAAGCT AT T C C TACT C CAAGATT GCCT GCACAC T TG 
************* 

TATGGACGTGTGCCAGTGGATATGTAA 

IIMINilllllNlNllMliill 

TTTT77TTTTTTT7TTTT7 7TTT77TT 

TATGGACGTGTGCCAGTGGATATGTAA 
TATGGACGTGTGCCAGTGGATATGTAA 



Group 2 

D56-AH7 
Z>13a-5 

D5S-AH7 

D5S-AH7 
Di3a-5 

USS-AH7 . 
DiL3a-S 



GAAGGATTGGCTGTTCGAATGGTTGCCTTGTC^TTGGGATGTATTATTCAATG 

| I ' • I 

GAAG GATT GG C T AT T CGAAT GGTT GCATTGT CAT TGGGATGTAT TATT CAAT GCTTT GAT 
************ ************* ************************** ****** 

TGGCAACGAATCGGCGAAGAATTGGTTGATATGACTGAAGGAACTGGACTTACTTTGCC 

tggcaacgaci4ggggaaggact 



t..****** *-** **** *********** 



************* ******** ******* 



a^gctcaacctttggtggcc^gtgtagcc^cgacctaa^tggctaatcttctctct 

I I I 

aaagctcaacctttagtg^caag 

************** ************************* *************** *** 

cagatttga 
cagatttga 

********* 



WO 03/078577 

f". 17 

Group 3 

-D56-AG10 

D35-33 

D34-62 

, P56-AG10 
D3S-33 
D34-62 

D56-AG10 

D35-33 

D34-S2 

B56-AG10 

D35-33 

D34-62 

Group 4 

D56-AA7 
D56-AE1 
P185-BD3 

D56-AA7 
D56-AE1 
D185-BD3 

D5 6-AA7 
ES6-AE1 
D185-BD3 
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ATAGGCTTTGCGACTTTAGTGAC&CATCTGACTTTTGG 
Ml 

ATJU^ATTTTGCGACTTTAGTGAC^CATCT 

**■* ****************************************************** 

Tri&GTA^CCATCZ^CACGCCAA^ 

XTTAGTAAGCCATCAAACACGCCAATTGACATGACAG^GGCGTAGGCGTTACTTTGCCT 

I I 
CTTAGTACGCCATCAAACACGCCAAa^ 

******* ****************** ********************************* 
AAGGTrAi^TC^GTTGAAGTTCTAATTACCCCTCGTTTACCTTCa^ 

II 

AAGGTTAATCAAGTTGAAGTTCTAkTTACe^ 

i I I i M 

AAGGTAA^TCAAGTGGAAGTTCTAATTAGCCCTCGTXTACCITCTAAGCTTTATGTATTC 
^^^^ ******** ************* ************************* *** 

TGA 



TCSA 



ATTATACTTGCAXTOCCAAXrCTTGGCAT 

! 

ATTATACTTGCATTG C CAATTCTTGGCATTACTTT GGGACGTTTGGTTCAGAA.CTT TG AG 

I ! ill 

ATTAT C CT TG CACT G CCAA.T T C T X GG CAT ZAC CTT GGG ACGCTTGGTSCAGAACTTT GAG 
lt ^.^.» w . ««-**** **************** ** ******** ***** ************ 

CTGTTGCCTCCTCC^GGCCAGTCGAAG 
CTGTTGCCTCCTCCAGGCCRGTCGAAGCTCGAC^ 

| III) I I 

TTGTTGCCTCCTCmGGACAGTCAAAGCtrTGACA^ 
**************** ***** ***** ***** *********** ** ** ****** 

CTCCACATTTTGAAGCATTCCACCA^ 

CTCCaJL^TTTGAAGCATTCCACCATTC 

I l I l II 

CTGOACATTTTGA&GCATTCCACCATT^^ 

+ *±**.*** * ********* * ********** **** * * 
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Group 5 

D56A-A3G 
D144-AE2 
D35-BB7 
D177-BA7 

t>56A-ABS 
D144-A32 
D35-BB7 
D177-BA7 

DSSA-ABS 
D144-AE2 
D35-BB7 
D177-BA7 

D56A-ABG 
D144-AS2 
D35-BB7 
D177-BA7 

Group 6 

D56-AG11 
D179-AA1 

D56-AG11 
D179-AA1 

D56-AG11 
D179-AA1 

D5S-AG11 
231.79 -AAL 



ggtatogcacttggggttgcatccatggaacttgct^ 

! \ ! A TTGCACnTGGGGTTGC^TCCAT^^ 

I ) 

- - -AXTGCACTTGGGGTTGCATCCATGGAACTTGCOTTGTCAAATCTTCTTTATGCA.TTT 



***********.**»****** *********** ****-***»**************** 

GATTGGGAGTTGCCTTATGGAGTGAAAAAAGAAGACATCGACACAAACGTTAGGCCTGGA 
GATTGGGaGTTGCCTTATGrGAGTGAAAAAAGAAGACATCGACA 

I i ! I M 

GATTGGGAGTTACCTTTTGGAATGAAA^AAGAAGACATTGACA 

•III' I M 

GATTGGGAGTTACCTTACGGAGTGAAAAAAGAAAAGAT TGA(2ACAAAT GT CAGGCCTGGA 
*********** **** *** **** ******* **** ******** * ********* 

ATTGC catgcacaagaaaaacgaactttgccttgtcccaaaaaa- ttatttataa- 

j^XTGCCATGCACAAGAAAAACGAACTTTGCC^ 

attacc^tgc^taagaa^^cgaactttat - - - - 

ATTAC CAT GCATAAGAAAAAC GAACTT T G C CTT AT C C CTAGAAA - T TAT C T ATAG 

X3P * ******* ***>************* *** **** * *** **** **** 



NIK II !! M 1 1 1 1 1 ! M ! Ml I 

AITGGGACGTGGATCXCATGCTAG 

lllllllllllllllllilllill 



ATTT CGTTTGGT TTAG CTAAT GCTTAT XT G C CATTGGC T CAATTAC XT TAT CAC TT TGAT 

ATTT CGTTTGG CTTAG CTAATG CTTATTTG C CATTGGCTCAAT TAC TATAT CA C TT CGAT 
*.********* *********************************** ******** *** 

TGGGAACTCCCCACTGGAATCAAACCAAGCGACTC^ 

tgg^ctccctLtggaatcg^^ 

*** ******* ******** ************************** *********-*■** 

actgccgctagaaaaagtgacctt^^ 

actgccgctagaaaaagtclacctttactttc 

^,******************************************^************** 

TGA 

TGA 
*** 



24/31 

-7? 



ATGCTATTTC^TTTAGCT^^ 

^gctatttcgtttagctaatgttggac^ 

tggaaactccctaatggac^aagtcatgagaa 
tggaaactccctaatggacaaactcaccl^^ 

********************** *** * ****************************** 

tcjgctaca&gaaaggatgaxcttgtttt^ 

TCTGCTACA&GAAAGGATG^ 

/„*,******************** ***************** * ****** 

CAGTCTATATCATCTATATGTACTCAATAATTGTATGC3GA. 

lllllIilllllilllllllilllllllllMMIIilll 



•agx^aacacagggcatcctttagcccagttgctctatttcttx 

********** ************* ******* 



ATGTCGTTTGGTTT- 

A**************"******** **** 

TGGAAATTCCCTCATA^GGTTAATGCAGCrGA 

111 II ) 

TGGAAACT GC CTGACAAGGTXAA.TGCAAATGATTTT CGCACTACTGAaACAA.GTAGAGTT 
****** ***** * ************ ******** ********************** 



TTTG 



C^GCAAGaAAAGATGACCTCTACT^ 



TT2GCAGCAAC^J&J3AT^ 



************ ******** 



*************** ********** ********* *** 
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n 1 



Group 14 

D177-BD7 
P177-BD5 

D177-BD7 
' D177-BD5 

D177-BD7 
D177-BD5 



ATTAATTTTTCAATACCACT^ 
AOTAATTTTTCAATACCACTTGTTGM^ 

****** ***************************************** ************* 

TCKTCACTTCCTGAGGGGATGCT 
TGGTCACTTCCTGAAGGGATGCT^ 

,*,*********** ++4.****** **„****************************>*** 
ACCATGCACAAGAAATCTCCCCTTTGCT^ 

AC CAT GGACAAGAAAT CTCCCCTTTGC T TAGTAG CTTCT GATTATA- CTTGT TGA- - 
*********************************************** ******* 



Group 15 

D56A-AG10 

D58-AD12 

,DS8-BC5 

DS6A-AG10 

D58-AD12 

D5S-BC5 

D5SA-AG10 , 
D5 3-AD12 
D58-BC5 

D56A-AG10 

D58-AD12 

D58-BC5 



ATGCAACT^K^CTTTATGCATTGGAAATGGCT^^ 
ATGC AACTTGGGCCTTATGCA^ 

ATGCAACTTGGGCTTTATG cattagaaatggc agtggcccatc ttcttctttg ctttact 
.************* ********** ******** **************** *** ****** 

tgcxsaattgccmatggtatgaam^ 

t gggaat t g c cagat ggtat gaaac caagtgag ct taaaatggatg at at tt t tgga ctc 

' TGGGAATTG C CAGAT GGTATG AAACCAAGTGAGCTTAAAATGGATGATATTTTTGGACTC 
************************************************************* 



ACTGCTO 



CAAAAGCTAATCGACTCGTGGCTGTGCCTACTCCACGTTTGmGTGTCCCCnrT 



ACTGCTCCAAGAGCTAATCGACTCGTGGCTGTGC 
ACTG CTCCAAGAGCTAATCGACTCGTGGCTGTC^ 



********** *********** 

TATTAATTGA 

I I I I 

TATTAA 

III) 

TATTAA 

****** 



»„****-******** *************** ** w** 
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'Ftc ^7 



Group 16 

D5G-AD6 
D56-AC11 
D3S-39 
DS8-BH4 

D56-ADS 

D35-39 
D5B-BH4 



ATGCTT^GGAGTGCGAGXATAGTGCGCGTCAGCTACCTAACTTGTATTTATAGATTCCAA 

ATGCTrTGGAGTGCGAGTATAGTGCGCGTCAGCTACCTAACTTGTATTTATAGATTCCAA 

ATGCTTTGGAGTGCGAGTATAGTGCGCGTC^GCTACCTAAGTTGTATTTATAGATTCCAA 

ATGCTT TGGAGTGCGAGTATAGTG CGCGT CAGCTAC CTAAC C TGTATTTATA.GATT C C AA 
***************************************** ****************** 

GTATATGCTGGGTCTGTGTCCAGAGTAGCATGA 
I 

GTATATGCTGGGTCTGTGTXCAGAGTAGCATGA 

GTATATGCTGGGTCTGTGTTCAGAGTAGCATGA 

' GTATAT GCTGGGT CTGTGT T CAGAGT AGCAT GA 
******************* ************* 



Group 17 

D73A-AD6 
D70A-BA11 
P 7 OA- BBS 

' D73A-AP6 
D70A-BA11 
D70A-BB5 

t) 73 A- ADS 

D70A-BA11 

07OA-BB5 

D73A-AD6 

D70A-BA11 

D70A-BB5 



CTGAATTTTGCAATGTTAGAGGGAAAAAXGGCACTTGCATTC^ 

CTGAATTTTG caatgttagaggcaaaaatgg CAC TTGCZAT tgattctacaacactatgct. 

Ml | 111 I I II! I I MM. 

AAT AATT T T GCAATGT T GGAAACTAAGAT TGC CT TAG CAAT GAT C CTACAG CGTT T T G CT 
************** ** * ** ** ** * *** **** ***** * * **** 

TTTGAGCTCTCTCCATCTTATGCA^ 

TTTGAGCT CTCTC CAT CTTATQCACACG CT C CTCATACAATTAT CACT CTGCAACCTT CAA 
TTCGAGCTTTCTC 

** ***** *********** ** ** ** *** * * ******* * ****** 

CATGGTGCTCCTTTGATTTTGCGCAAGCTGTAG : 7" 

TGTGGTGC T CAC TTAAT C T TGCAAAAAT TATAGGT C CTTAAT CT GGArTT C CCAT TATT G 
******** ** ** **** ** * *** 

TTTTTTTTTTTTTTTTTT7TTTTTTTTTTTT77TTTTTTTTTTTTT 
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Group 18 

D70A-AB5 
D70A-AA8 

D70A-ABS 
D70A-AAB 

D70A-AB5 
D70A-AA3 

D70A-.AB5 
D70A-AAB 

D70A-AB5 
D70A-AA8 " 



AGCGAkGGGGTGGCAA&GGCAAC2^ 

CCGCGAAAATGCATTGGGC^U^AACTTCGCGATTT^ 
CCGCGAAAATGCATTGGGCAAAACTTCGCGATT^ 

****^»**** ******************************************** *^ 
ATTCTACAACGCTTCTCCTTCGAGCTCTCCCCATCTTA^ 
. ATTCTACAACGCTTCTCCTTC<^ 

***************************** ************************ ******* 
GTCACTTTGAAACCC^U^TATGGTGCTCCCCTAAT^^ 

GTCACTTTGAAACCCAAATATGGTGCTCCCCXAATAATOCACAGGCTGTAGTCCTGT 

********************************* ************************* 

AATATG CTAT C CGAGGAATTCAGTT C CT 

i 1 1 ] i 1 1 1 1 1 1 1 1 1 f f 1 1 ] 1 1 1 1 1 ! J 1 1 



Group 19 

D70A-AB8 
D70A-BH2 . 
P70A-AA4 

D70A-ABS 
D70A-BH2 
D70A-AA4 

D70A-ABS 
D7QA-BH2 
D70A-AA4 



C&A&AT TT TGCCATGTTAGAAGCAAAGATGG CTCTGT CTATGATCCT GCAACGCTTCTCT 

111 III I I. I I I 

AT AAA CTTTG CAAT GACAG AAG CGAAGATGG CT ATGG CT ATGATT CTG CAAC GCT T CT C C 

ATAAACTTTGCA^T^ 

******.**** .** ** ******* **_*********** 

TTT GAACT GT CT C C GT CTTAT G CACATGC C CCT CAGT CCATiVT TAAC C G T - CAGG CA CAA 

II | | i I I I II I I I 

TTTG AG CXAT C T C CAT CT TACACACAT G CT CCA CAGT CT GTAATAACTAT GCAAC C C CAA 

TTTGAGCT AT CT C CATCTTACACACATG CT C CACAGT C TGTAAT AACTAT G CAAC C C CAA 
*****.*/*****.***** .******* **-***** .**.«** * **_**_**. 

TAT GGTG C T CCAC T TAT TTT C CACAAG C T ATAATTT GGTACT T G T GAAAGGA CAAT ATGT 

i i i i ii i . miiiiiiiiiiiiiiiiiiiiiiii 

TATGGTGCTCCTCTTATATTGCACAAATTGTAA 



TATGGTGGTCCTCTTATATTGCACAAATTGTAA- 
***********:*****:** ** **- *.**- 
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p/c -77 

CaW^CTTTGCAATGATGGAMCAAAAATGGCAG 

CAAAACTTCGCAATGA^GGAAG CAAAAATGG CAGTAG C TAT GAT AC T ACAT AAATT T T C C 
i IN .1 ill II I I UN I 

caaaattttgc gatgttagaagcaaagactactttggctatgat c ctacaacgct t gtcc 
#irx » y ***** *** * ******** * • * * ******** ***** ** ** + 

TTTGAACTATCCCCTTCTTATAC^C^TO 

TTTGAACTATCCCCTTCTTAT^^ 

III I 1 1 1 1 1 I I ! I I I I I 

TTTGAACTGTCTCCATCTTATGC^C^TGCTCCTCAGTCCATAATAACrTTGCAACCCCAG 
******** ** ** ****** ********** * ** * *** * ** ** *** 

TATG GT GOT C CT CTG CTTATG CG CAGAC T TTAA 

TATGGTGCTCCTCTGCTTATGCGCAGACTTTAA 

| M I M I I I I 
TATGGTGCTCCACTTAXTTTGCATAAAATAXAG 

*********** ** ** *** * * * ** 



TAT AG CT TGGGGCT CAA'GGAGAT TCAAG CTAGCTTAGC TAAT CTT C TACATGGAT T TAAC 
(LtAMTTGKXSGCTC^ 

TATAGCATGGGGCTCAAGGCGAT TCAAGCTAGC T TAG C TAATCTT CTACAT GGATT TAAC 
************ *************»«********************-**** 

TGGTCATTGCCTGATAATAXGACTCCT^ 
TGGTCATTGCCTGATAATATGACTCCTGAGGACCT 



TG GTCATTGCCTGAT AATATGACTCCT GAQGAG CTCAACATGG2V3? GAGAT ' 



TTTTGGGCTC 
**************************** 



t *************************** *■*«* 
T CTACACCTAAAAAATCTCCACTTGCTAC 

TCTACACCTAAAAAATTTCCACrTGCTACTGTGATTGAGCCAAGACTTTCACCAAAACTT 

T CTACACCTAAAAAATTT CCACTTGCTACTGTGATTGAG CCAAGACTTTCACCAAAACT T 
*^ T *»*^********^******************«**********************^* 

TACTCTGTTTGA 

TACTCTGTTTGA 

TACTCTGTTTGA 
************ 



WO 03/078577 



30/31 



PCT/US03/07430 



Group 25 

D58-AA1 

B185-3G2 

D1S5-BC1 

D58-AA1 

D185-BG2 

D185-BC1 

D5S-AA1 
D185-BG2 

mas-BCi 



TTGGGCTTGGCAACGGTGCATGTGAATTTGATGTTG^ 

TTGGGCTTGGCAACGGTGCATGTGAATTIG^^^ 

TTGGGCTTGGCAACGGTGCATGTGAATTTGATGTTOT 

^+r****+ + ±*+**+-**~***********'** 1 *'* :>t ***' k *'**** *-**■«-*■******* **•* 
TGGTCCGCTTACCCGGA3AaJ3W3GAAAGTGGA 

TGGTC CGCTTACCCGGAAAATAGGAAAGTG- GATTTACrGAGAAATTGGAA"CTTACTGTG 

TC^TCCGCTTACCCGGAAAATAGGAAAGTGGATTTTACTGAGAAATTGGAATTTACTGTG 
*^*^*****************~******** . :**********+*+************** 

GTGATGAAAAATCCTTTAAGAGCTAAGGTCAAGCCAAGAATGCAAGTGGTGTAA 

1 1 j 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 ) 1 1 1 
GTGA ii TT77TT7TTTTTTT.TTTTTTTTTT7TTTTTTT7TTTTTTTTTTTTTTT 

GTGATGAAAAACCCOOTAAGAGCTAAGGTCAAGCCAAGAATGCAAGTGGTGTAA 

**** 



Group 2 8 

D177-BF7 
.D185-BD2 
D13S-BE1 

D177-BP7 
D185-BD2 
D135-SE1 

D3/77-BF7 
D185-BD2 
DiaS-BBl 



ATCACATTTGCTAAGTTTGTQAATGAGCTAGC^^ 

AT CACATTTG C TAAG TTT GTGAATGAG C TAGCATT GG CAAGAT TAATGT T C CAT T T TGAT 
ATCACATTTGCTAAGTTTGTGAATGAGCTAGCACT 

******** *****^*****+********^^************** **************** 

TTCTCGCEACCAAAAG<3AGTTAAG 

ttctcgctaccaaaaggagttaagcatgcggaottggacgtggagg^ 

TTCXCGCTAC CAAAAC^AGTTA^GCATGAG<^TTTGGACGTGGAGK2AA^ TG CTGGAATT 
******************+*********- ***************"******** ******** 

ACTGTTAGAAGGAAGTTCCCCCrTTTAGCCGTCGCCACTCCATGCTCGTGA 

ACTGTTAGAAGGj^GTTCCCCCTTTTAGCCGTCGCCACTCCATGCTCGTGA 

ACTGTrAGGAGGAAGTTCCCCCXTTTAGCCGTCGCCACTCCATGCTCGTGA 
********.*********************•********************* 
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Group 3 0 

D70A-AA12 
D17S-BF2 

D70A-AA12 
D17S-BF2 

D7QA-AA12 
D176-BF2 



ArGTCATTTGGTTTAGCTAATCTTrACTTACC^TTGGCTCAATTAOTCTATC^CXT'rGAC 

1 | 11111 II I I I 

ATATGAT TTGGTTTGGCTAATGTTTATTT G CCACTAG CT C AATTGTT ATATC^TTTTGAT 
******* **** ** *** * ******** * *•»*** ***** - 

TGGAAACT C CCAACCGGAATC AAGC CAAGAGACTTGGACTTGACCGAATTAT CGGG AATA 

1 I II I III II * I I 

TGGAAACTCCCTACTGGAATCAATTGAAGTGACTTGGACATGACTGAGTCGTCAGGAGTA 
*********** ** ******** **** ********* **** ** * ** *** ** 

ACTATTGCTAGAAACX^TGACCTTTACT 

II . i ii i M .: I I i 

*** ********** **** '» ***** * *********. ******* ***** **** 



D70A-AA12 
D17S-BF2- 



TAA 
I 

TGA 
t- * 



